Chicken embryo lethal orphan (CELO) virus

ABSTRACT

A CELO virus obtained by in vitro manipulation of a plasmid-cloned CELO virus DNA is suitable for the production of vectors for gene therapy and as a vaccine against infectious diseases in humans and animals, particularly birds.

[0001] The invention relates to adenoviruses. The large family ofadenoviruses is subdivided according to its host into adenoviruses whichinfect mammals (the mastadenoviridae) and adenoviruses which infectbirds (the aviadenoviridae). The CELO virus (Chicken Embryo LethalOrphan; article by Mcferran, et al., 1977; McCracken and Adair, 1993)was identified as an infectious agent in 1957 (Yates and Fry, 1957).CELO virus is classified as a poultry adenovirus type 1 (FAV-1) andfirst aroused interest because of its property of being tumorigenic inbaby hamsters. However, since infection with the CELO virus does nothave any serious health and economic consequences, the interest in thisvirus disappeared in recent years. The FAV-1 adenoviruses can beisolated from healthy chickens and do not cause any disease whenreintroduced experimentally into chickens (Cowen, et al., 1978). Theirisolation from sick birds is presumably the result of adenovirusreplication in a host which has an immune system weakened by otherinfluences.

[0002] The general structural organisation of CELO virus, with anicosahedral capsid of 70-80 nm, made up of hexon and penton structures,is similar to that of the mammalian adenoviruses (Layer, et al., 1971).The CELO virus genome is a linear, double-stranded DNA molecule, the DNAbeing condensed inside the virion by virus-coded core proteins (Layer etal., 1971; Li, et al., 1984b). The CELO virus genome has covalentlybound terminal proteins (Li, et al., 1983) and the genome has invertedterminal repeats (ITRs), although they are shorter than the mammalianITRs (Aleström, et al., 1982b; Sheppard and Trist, 1992). The CELO viruscodes a protease with 61-69% homology for the mammalian adenovirusproteases (Cai and Weber, 1993).

[0003] There are significant differences between CELO virus and themastadenoviruses. CELO virus has a larger genome, with sequence homologywith Ad5 which can only be detected in two short regions of the CELOvirus genome (by hybridisation) (Aleström, et al., 1982a). The CELOvirion has been reported to have two fibres of different lengths at eachvertex. The CELO virus cannot complement the E1A functions of Ad5, andthe replication of CELO virus is not made easier by the activity ofAd5E1 (Li, et al., 1984c).

[0004] Within the scope of the present invention, total sequenceanalysis of the CELO virus was carried out; on the one hand because itis useful for understanding the biology of adenoviruses to clarify thegenomic organisation of an adenovirus which is very remote from themammalian adenoviruses generally studied. Since the conditions fortransmission and survival for a virus which infects a type of bird arepresumably different than for mammalian viruses, it is possible that thebird adenoviruses have acquired new virus functions or exhibit a higherdegree of variability than the mastadenoviridae. The complete CELO virussequence also permits changes in the CELO virus genome with respect tofunctional analysis.

[0005] Since adenovirus vectors have proved highly effective vectors forgene transfer (see the summarising article by Graham, 1990; Kozarsky andWilson, 1993; Trapnell and Gorziglia, 1994), the complete CELO virussequence, on the other hand, is particularly interesting as the basisfor preparing new recombinant vectors for gene transfer.

[0006] Sequence analysis has shown that the CELO virus genome has 43.8kb, being more than 8 kb longer than the human subtypes Ad2 and Ad5. Thegenes for the main structural proteins (hexon, penton based, IIIa,fibres, pVI, pVII, pVIII) are on the one hand both present and alsolocated at the corresponding sites in the genome. The genes of the earlyregion 2 (E2; DNA binding protein, DNA polymerase and terminal protein)are also present. However, the CELO virus lacks sequences homologous tothe regions E1, E3 and E4 of the mammalian adenoviruses.

[0007] There are approximately 5 kb at the left hand end and 15 kb atthe right hand end of the CELO virus genome, where there is onlyrestricted homology or no homology at all with the mastadenovirusgenomes. These new sequences contain a number of open reading frames,and it can be assumed that these code for functions which replace themissing E1, E3 and possibly E4 regions.

[0008] Parts of the CELO virus sequence have already been published;they are listed in Table 1, as are the differences between the sequenceknown from the databank and the sequence determined within the scope ofthe present invention. From studies concentrating on specific viralgenes, a homolog of the VA RNA gene of mastadenovirus was known(Larsson, et al., 1986) and part of the genome sequence which carriesthe endoprotease has been described (Cai and Weber, 1993). In addition,fragments of the CELO virus genome have been published (Akopian, et al.,1990; Akopian, et al., 1992; Hess, et al., 1995). The sequence of thepenton base of the related virus FAV-10 has also been reported (Sheppardand Trist, 1992). Some other sequence fragments have been deposited inthe databank and are also shown in Table 1. In all, about 50% of theCELO virus genome is available in the form of fragments (total about 24kb). The sequence obtained within the scope of the present invention iscomplete and has the advantage of having been obtained from a singleisolated material.

[0009] The total sequence of the CELO virus is shown in the sequencelisting (in the sequence listing the word “complementary” indicates thatthe open reading frames are present in the reverse arrangement). Itshows a large number of striking differences between Ad2 and the CELOvirus. The organisation of the recognisable open reading frames (ORFs)of the CELO virus genome based on the sequence analysis, compared withAd2, is shown in FIG. 1A: the Figure shows an overview of the genomicorganisation of Ad2/5 and CELO virus. The arrows indicate the positionof the coding regions but not the exact cleavage patterns of the geneproducts. The pattern of the CELO virus also (in the first 6,000 bp andin the last 13,000 bp) gives all the non-associated open reading frameswhich begin with a methionine and which code for more than 99 amino acidgroups. The central region of the two genomes which show homology on thebasis of dot matrix analysis (cf. FIG. 3) and the regions at the ends ofthe CELO virus genome which have no homology with other adenoviruses(“unique to CELO”) are given. The abbreviations in the Figure, whichalso correspond to those in the Tables, have the following meanings: PB,penton base; EP, endoproteinase; DBP, DNA binding protein; bTP,pre-terminal protein; pol, DNA polymerase.

[0010] The sequenced CELO virus genome has a length of 43,804 bp and hasa content of G+C of 54.3%. It had already been presumed at an earlierstage that the CELO virus genome is much larger than the mastadenovirusgenome with 34-36 kb; it has been found that the CELO virus DNA has aweight of 30×10⁶ Daltons, determined according to its sedimentationcoefficient (Layer, et al., 1971), compared with 24×10⁶ Daltons for Ad2(Green, et al., 1967). The size of the CELO virus genome determined bythe addition of the restriction fragments is about 43 kb (Cai and Weber,1993; Denisova, et al., 1979). A Pulsed Field Gel Analysis of the CELOvirus genome isolated from purified virions is shown in FIG. 2A and iscompared with the DNA isolated from Ads dl1014 (34,600 bp; Bridge andKetner, 1989) or Wild-type Ad5-virions (35,935 bp; vt300; Chroboczek, etal., 1992; Jones and Shenk, 1978); a mixture of uncleaved bacteriophageλ-DNA and γ-DNA cleaved with five different restriction enzymes (Biorad)was used as the size marker (tracks 1 and 7 show the molecular weightmarkers, track 2 shows the DNA of Ad5 dl1014, track 3 shows the DNA ofAd5 wt300, track 4 shows the CELO virus DNA, track 5 shows the DNA ofOTE, track 6 shows the DNA of Indiana C). FIG. 2A shows that the CELOvirus genome has a length of 44 kb. From this analysis it is clear thatthe CELO virus genome is actually substantially larger than the genomeof the mammalian virus. Calculations based on the migration of fragmentsof the lambda bacteriophage give a size of 43 kb for the CELO virusgenome. The DNA extracted from two other FAV-1 isolates, Indiana C andOTE, co-migrates with the CELO virus species, which is further evidenceof the size of the CELO virus genome. FIG. 2B shows that the CELO virussequence contained in the bacterial plasmid pBR327 has the same size.

[0011] There is no identifiable E1 region. No significant homology couldbe found between the CELO virus genome and the first 4,000 bp of Ad2.There are a few small open reading frames in the first 5,000 bp of CELOvirus which might possibly perform some of the E1 tasks. An open readingframe at the right hand end of the virus genome (GAM-1) may replace E1B19K in functional assays without there being any significant homologybetween GAM-1 and E1B 19K. In order to confirm that the left hand endoriginates from the Wild type CELO virus genome and is not the sequenceof a cloned variant, various tests were carried out; comparison of thedirect sequence analysis of CELO virions at three different sites withthe corresponding sites of the cloned sequences; Southern analyses withDNA from various virus isolates which yielded the same restrictionfragments; pulsed field gel electrophoresis of various virus genomeswhich showed no heterogenicity.

[0012] There is no identifiable E3 region; the two small open readingframes in the corresponding region of the CELO virus have no significanthomology with the E3 functions described.

[0013] There is a group of small open reading frames between 36,000 and31,000 the position of which indicates the mammalian virus E4 region,but with additional 8 kb sequence at the right hand end of the CELOvirus.

[0014] Nor was any sequence resembling protein IX identified (protein IXis essential for the hexon-hexon interactions and the stability of themammalian adenovirus virions).

[0015] A protein V gene was not identified either.

[0016] The following regions are conserved between CELO virus and Ad2:the central part of the CELO virus genome, from the IVa2 gene(approximately from nucleotide (nt) 5,000) on the left hand strand up tothe fibre genes on the right hand strand (approximately up to nt 33,000)is organised as in the mastadenoviruses, and the majority of theimportant viral genes can be identified both by their position and bysequence homology. Earlier studies on the homology between CELO and Ad2(Aleström, et al., 1982a) showed two regions of the CELO virus whichcross-hybridise with the Ad2 sequence. These two fragments are nt 5,626to 8,877 (coding for IVa2 and the carboxy terminus of DNA polymerase)and nt 17,881 to 21,607 (coding for the hexon). The dot matrix analysisshown in FIG. 3 (carried out using the UWGCG program Compare with awindow of 30 and a stringency of 20; summarised in FIG. 1A) shows thatthe total DNA sequence homology between CELO virus and Ads is mapped inthe central region of the CELO virus genome. This is to be expectedbecause the capsid proteins are coded in this central region and thecoarse structure of the CELO virion is comparable with the capsid of themammalian adenovirus (Layer, et al., 1971; Li, et al., 1984a). The geneswhich code for proteins corresponding to the human adenovirus proteinshexon, IIIa, penton base, protein VI and protein VIII, are present, andindeed in the expected sequence and position (FIG. 1A and Table 2A;Table 2B shows non-associated open reading frames which code for geneproducts with more than 99 amino acid groups). Each vertex of themastadenovirus virion contains a pentamer of the penton base protein inconjunction with a single fibre consisting of three copies of the fibrepolypeptide. Ad2, like most mastadenoviruses, has a single fibre gene,some adenovirus types have two fibre genes. The CELO virus genome codesfor two fibre polypeptides of different lengths and sequences.

[0017] DNA binding proteins were identified in the region E2 (Li, etal., 1984c); four proteins with similar peptide maps were described,indicating a single precursor which is then cleaved or decomposed. Theleft hand open reading frame of the CELO virus genome, starting at nt23,224, is located in the expected DNA binding protein region. The genescoding for DNA polymerase and pTP (pre-terminal protein) are present andin the expected positions (FIG. 1A, Table 2A).

[0018] With a view to the preparation of vectors based on the CELO virusit is useful to identify the mechanisms which the CELO virus uses inorder to package almost 44 kb of DNA into a virion which is of a similarsize to the human adenoviruses which are subjected to considerablerestrictions on their packaging capacity (Bett, et al., 1983; Caravokyriand Leppard, 1995; Ghosh-Choudhury, 1987). One possibility is that theCELO virion, although virtually identical in size to Ad2 and Ad5, has asufficiently widened structure in order to accommodate the largergenome. An alternative hypothesis is that CELO has a different mechanismfor condensing DNA and therefore has differences in its provision ofcore proteins which are responsible for DNA packaging. Layer et al., in1971 identified two proteins in the nucleus of the CELO virus and notedthe absence of a molecule resembling protein V. Li, et al., 1984b, usedelectrophoresis with a higher resolution and reported a nuclearstructure with three polypeptides (20 kD, 12 kD and 9.5 kD). These twofindings lead one to conclude that the CELO virus must lack the largerbasic nuclear protein V (41 kD) which occurs in mammalian adenoviruses.Perhaps the absence of protein V and/or the presence of smaller basicproteins is responsible for the additional packaging capacity of theCELO virion. The smallest of the CELO virus core proteins identified byLi, et al., 1984b (9.5 kD) is most closely associated with the virusDNA, similar to protein VII of the human adenovirus. An open readingframe which leads one to expect a protein with 8,597 D having 72 aminoacids is located at nt 16,679; the coded protein is rich in arginine(32.9 mol%) and contains two cleavage sites for protease (pVII of Ad2has only one cleavage site). An open reading frame which leads one toexpect a protein with 19,777 D having 188 amino acid groups is locatedat nt 16,929. The protein has protease cleavage sites after groups 22,128 and 145, and the carboxy-terminal groups have homology with pX ofmastadenovirus. FIG. 4 shows the amino acid sequences of protein VII andpX of various mastadenoviruses compared with the CELO virus and the coreproteins Core 2 and Core 1 of FAV-10. The sequences were arranged usingthe UWGCG Bestfit Program with a gap weight of 3.0 and a weight and gaplength (“Gap Length Weight”) of 0.1. The protease cleavage sites ofadenovirus are underlined. In connection with this it is interestingthat the mastadenovirus DNA binding protein designated “mu” consistingof 19 groups is formed by two protease cleavings of the pX precursor(Hosokava and Sung, 1976; Weber and Anderson, 1988; Anderson, et al.,1989). Cleaving of the protein having 188 groups after groups 128 and145 would produce a mu-like basic protein consisting of 17 groups (41%arginine, 12% lysine). The uncleaved form of the protein is also highlybasic; the uncleaved copies of this protein could correspond to the 20kD core protein observed by Li, et al., 1984b; a third 12 kD coreprotein identified by these authors could not yet be assigned.

[0019] Moreover, some new or non-assigned open reading frames were foundin the CELO virus genome. A summary of these open reading frames isshown in Table 2A; these open reading frames are also given in FIG. 1A.This summary was restricted to the sequences from nt 0-6,000 and31,000-43,804 and only ORFs which contain a methionine group and codefor a protein >99 amino acid groups are mentioned. As already stated,there is an ORF at nt 1999 which codes for a protein having homologywith parvovirus—REP, and an ORF at nt 794 having homology with dUTPaseand Ad2 E4 ORF1. The objective of the present invention was to prepare anew CELO virus.

[0020] Thus, on the basis of the complete CELO virus genome sequence,the present invention relates to a CELO virus obtained by in vitromanipulation of a plasmid-cloned CELO virus DNA.

[0021] The CELO virus according to the invention derived from thegenomic DNA contains, in a preferred embodiment, the left and rightterminal repeat and the packaging signal and has modifications in theform of insertions and/or deletions and/or mutations in regions of theCELO virus DNA which are different therefrom.

[0022] The left or right terminal repeat (“Inverted Terminal Repeat”,ITR) extends from nucleotides 1-68 or from nucleotides 43734-43804, thepackaging signal (also referred to as “Psi”) extends from nucleotides70-200. Modifications-in DNA sections other than these ensures that thegenes affected by the modification are non-functional or are deleted.

[0023] Preferably, modifications of the CELO virus genome are undertakenwhich are located on a section of the CELO virus DNA which includes thenucleotides from about 201 to about 5,000 (following the left terminalrepeat of the section at the left hand end, hereinafter referred to as“Section A”) and/or on a section which includes the nucleotides fromabout 31,800-about 43,734 (the section at the right hand end located infront of the right terminal repeat, hereinafter referred to as “SectionB”) and/or on a section which includes the nucleotides from about28,114-30,495 (the region of the fibre 1 gene, hereinafter referred toas “Section C”).

[0024] A CELO virus in which certain genes are non-functional or aredeleted, e.g. genes which affect the immune response of the host, suchas antagonists to genes of the E3 region of mammalian adenoviruses, canbe used as a vaccine.

[0025] In one embodiment of the invention the CELO virus contains one ormore foreign DNA molecules, particularly a foreign DNA which is to beexpressed in a host organism. In this embodiment the CELO virus acts asa vector which is capable of transporting the foreign DNA into highereukaryotic cells, tissue or organisms, particularly mammals and birds,and expressing it therein.

[0026] Suitable insertion sites for the foreign DNA are the sections Aand/or B and/or C.

[0027] The foreign DNA preferably replaces one or more sequences fromthese sections.

[0028] The CELO virus according to the invention is contained on aplasmid which is replicatable in bacteria or yeast and which yieldsvirus particles after being introduced into suitable cells. Examples ofsuitable cells are bird embryo kidney or liver cells.

[0029] With a view to using a recombinant CELO virus vector for genetherapy, the foreign DNA may consist of one or more therapeuticallyactive genes. Examples of these are genes coding for immunomodulators ormodulators of inflammatory processes (cytokines such as IL-2, GM-CSF,IL-1, IL-6, IL-12; interferons, tumour antigens, IKB, and derivatives ofIKB which lack serine phosphorylation sites (Traenckner, et al., 1995)or which lack lysine ubiquitinisation sites; glucocorticoid receptors;enzymes such as catalase, manganese superoxide dismutase, glutathioneperoxidase, LIP members of the C/EBP family such as LIP or LAP(Descombes and Schibler, 1991), ADF (Tagaya, et al., 1989)), genes whichinfluence apoptosis (members of the Bcl-2 family such as Bcl-2,adenovirus E1B19K, Mcl-2; BAX; IRF-2; members of the ICE proteasefamily; variants of cJun, such as TAM-67 (Brown, et al., 1991);adenovirus E1A; p53) and genes which code for other therapeutic proteins(e.g. clotting factors such as factor VIII or IX; growth factors such aserythropoetin; cystic fibrosis transmembrane regulator gene (CFTR);dystrophin and derivatives thereof; globin; the LDL receptor; geneswhich are absent in lysosomal storage dysfunctions such asβ-glucuronidase; etc.).

[0030] With regard to the production of cellular tumour vaccines or forpharmaceutical compositions with which the immune response to tumours isto be intensified, the foreign DNA codes for immunostimulating proteinsor tumour antigens or fragments thereof.

[0031] The therapeutically active DNA may also code for antisensemolecules which prevent the expression of genes or the transcription ofspecific RNA sequences in the target cell.

[0032] With regard to the use of the recombinant CELO virus vector as avaccine, the foreign DNA codes for one or more antigens which bringabout an immune response in the individual treated.

[0033] In one embodiment of the invention the foreign DNA codes for anantigen derived from a human pathogen, particularly a pathogen ofinfectious diseases.

[0034] Epitopes which can be expressed by recombinant CELO virusesinclude epitopes derived from all kinds of human viral pathogens such asHIV, hepatitis A, B, C, hanta virus, polio virus, influenza virus,respiratory syncytial virus, measles, mumps, rubella, papilloma and manyother viruses. The non-viral pathogens include trypanosomes (the causalagents of sleeping sickness and Chagas' sickness), leishmania,Plasmodium falciparum (malaria), various bacterial pathogens such as thepathogens which cause tuberculosis, leprosy, Pseudomonas aeruginosa(complications in cystic fibrosis) and many others.

[0035] A summary of vaccines based on mastadenoviruses is given in Table3; the epitopes mentioned therein by way of example can also be used forinsertion in a vector based on the CELO virus.

[0036] With regard to the use of the recombinant CELO virus vector as avaccine in the veterinary field, e.g. for birds, particularly poultry,the foreign DNA, in another preferred embodiment, codes for an antigenderived from a protein of a pathogen of animal diseases, particularlyinfectious diseases in birds.

[0037] Examples of pathogens of bird diseases are Avian InfectiousBronchitis Virus (IBV, a corona virus; Jia, et al., 1995; Ignjatovic andMcWaters, 1991; Kusters, et al., 1990; Lenstra, et al., 1989; Cavanagh,et al., 1988; Cunningham, 1975), Avian Influenza Virus (OrthomyxovirusType A; Kodihalli, et al., 1994; Treanor, et al., 1991; Tripathy andSchnitzlein, 1991), Fowlpox-Virus (McMillen, et al., 1994), AvianInfectious Laryngotracheitis Virus (Guo, et al., 1994; Scholz, et al.,1993; Keeler, et al., 1991), Mycoplasma Gallisepticum (Nascimento, etal., 1993), Avian Pasteurella Multocida (Wilson, et al., 1993; Lee, etal., 1991; Hertman, et al., 1980; Hertman, et al., 1979), Avian Reovirus(Ni and Kemp, 1992; Huang, et al., 1987), Marek's Disease Virus (MDV;Malkinson, et al., 1992; Scott, et al., 1989), Herpes virus of turkeys(HVT, Herpes virus of Turkeys), Newcastle Disease Virus (NDV; Cosset, etal., 1991; Morrison, et al., 1990), Avian Paramyxovirus Type 1 (Jestin,et al., 1989), Avipoxvirus Isolates (Schnitzlein, et al., 1988), such asJuncopox, Pigeon Pox, and Field—(Field) and vaccine strains of bird poxviruses, Avian Encephalomyelitis Virus (Shafren and Tannock, 1991;Nicholas, et al., 1987; Deshmukh, et al., 1974), Avian Sarcoma Virus,Rotavirus (Estes and Graham, 1985), Avian Reovirus (Haffer, 1984;Gouvea, et al., 1983; Gouvea and Schnitzer, 1982), H7 Influenza Virus(Fynan, et al., 1993).

[0038] Apart from DNA sequences which code for therapeutically activegene products or for antigens, the foreign DNA may code for proteins orprotein fragments which change the behaviour of the CELO virus,particularly its ability to bind to cells, with regard to the use ofmammals, particularly on mammalian cells. Examples of such proteins arefibre or penton base proteins from mammalian and other adenoviruses,surface proteins of other viruses and ligands which have the ability tobind to mammalian cells and transport the CELO virus into the cells.Suitable ligands include transferrin from various mammalian species,lectins, antibodies or antibody fragments, etc. The skilled person willbe aware of ligands of this kind, other examples can be found in WO93/07283.

[0039] The recombinant CELO virus may contain one or more foreign DNAmolecules. These may be inserted either in tandem or spaced apart indifferent sections of the CELO virus sequence.

[0040] The foreign DNA is under the control of regulatory sequences;suitable promoters include for example the CMV immediate earlypromoter/enhancer, the Rous sarcoma virus LTR, the adenovirus major latepromoter and the CELO virus major late promoter.

[0041] The suitability of the CELO virus for preparing vectors and theadvantages of these vectors and their applications are basedparticularly on the following properties of the CELO virus:

[0042] i) Safety: Naturally, the CELO virus does not replicate inmammalian cells. Therefore, vectors based on this virus can be used inhumans without any danger of a subsequent infection with a Wild typehuman adenovirus complementing the vector and possibly allowingreplication. This is an advantage over the Ad2 and Ads vectors used atpresent.

[0043] ii) Increased packaging capacity: The CELO virus genome is about44 kb long, compared with the 36 kb of the Ad5 genome. Both viruses havecomparable virion sizes, so that with a CELO virus vector it is possibleto expand the strict packaging limit of 35 kb which is available withAds. On the basis of the sequencing carried out within the scope of thepresent invention, DNA-packaging core proteins of the CELO virus wereidentified, and striking differences were found from Ad2 which could beresponsible for the increased packaging capacity. There are about 13 kbat both ends of the CELO virus which would appear not to code forstructural proteins (e.g. capsid components) or for proteins requireddirectly for the virus replication (e.g. DNA polymerase). For theproduction of vectors, these sequences on the CELO virus genome can beremoved and if necessary replaced by complementing cell lines. Thesesequences can be assumed to code for the immune functions or theapoptotic functions (e.g. GAM-1) of the host cell or to be involved inthe activation of the host cell for virus replication (antagonists tothe E1, E3 and E4 regions of Ad2). These are the gene types which areeither non-essential for virus growth in the cell culture, such as theE3 genes of Ad2 (Wold and Gooding, 1991; Gooding, 1992), or which areeasily removed from the virus and can be expressed by a complementingcell line, such as the E1 region in 293 cells (Graham, et al., 1977) andthe E4 region in W162 cells (Weinberg and Ketner, 1983).

[0044] iii) Stability: The CELO virion is remarkably stable. Itsinfectivity and its ability to transport DNA withstand a 30 minutetreatment at 60° C. As a comparison, Ad5 loses two powers of ten of itsinfectivity at 48° C. and is completely inactivated at 52° C.Presumably, the CELO virus did not develop its heat stability naturally,rather this heat stability would indicate a reaction to another type ofselective pressure on the virion. The natural route of CELO virusinfection is a faecal-oral route, which requires the virion to survivecontact with a chemically aggressive environment with extreme pH valuesand with proteases. For particular applications in gene therapy, a moreresistant virus would be desirable which would survive, for example, inthe digestive tract or in the lungs of a patient suffering from cysticfibrosis.

[0045] iv) Targeted use: The CELO virus binds only slightly to mammaliancells on its own and for efficient entry into the cell requires theaddition of a ligand (transferrin or lectin; Cotten, et al., 1993).Therefore, recombinant CELO virions are unable to penetrate into humancells, resulting in the following possible applications:

[0046] The virus may be genetically modified as stated above in order toexpress, on its surface, ligands which enable targeted transportation,such as for example specific peptides or fibres and/or penton bases ofhuman adenoviruses.

[0047] Another possibility is the chemical modification of the virus inorder to couple specific ligands such as transferrin thereto, asproposed for example in WO 94/24299 and additionally the virus may bebiotinylated (W93/07283) and bound via streptavidin to biotinylatedligands such as wheatgerm agglutinin or other lectins (WO 93/07283).CELO virus vectors thus do not have the disadvantages of humanadenoviruses which have good binding ability to human cells but have tobe masked for a specific, targeted use mediated by the ligand.

[0048] v) Possibility of use for vaccines: The CELO virus is seldomconnected to diseases in birds, indicating that it provokes a strongprotective immune response in bird hosts. The CELO virus vector caneasily be adapted for the expression of new vaccine epitopes.

[0049] With regard to the removal of regions of the CELO virus DNA, itwas concluded from the results obtained with the mastadenovirus that,when the central sections of the CELO genome are removed, these sectionshave to be made available in trans, e.g. by a packaging cell line. Thisrestriction is based on the large quantity of virion components whichare necessary for the assembly of the virus and the need to produce acell line which is capable of producing the corresponding amounts ofthese proteins without any toxicity.

[0050] The approaches which have hitherto proved more successful in thisrespect with mastadenoviruses consisted of the production of cell lineswhich express regulatory proteins (E1 and E4 regions) or enzymaticproteins (DNA polymerase, DNA binding protein), because these proteinsare not required in large amounts during a productive virus infection.

[0051] Starting from the analysis of the known CELO genes, the sectionsof the genome from nt about 12,000 to about 33,000 which code forstructural components of the virus are preferably not interrupted. Theregion from about nt 5,000 to about 12,000 codes for the E2 genes IVa2(a viral transcription factor), viral DNA polymerase (POL) and theterminal virus protein (viral terminal protein; pTP). These genes areessential for the function of mastadenoviruses; they ought therefore tobe essential for the CELO virus as well. However, it is theoreticallyalso possible to have deletions of essential genes of this kind providedthat they can be produced in trans, e.g. by a packaging line. Forexample, a packaging cell line was prepared which produces Ad5 DNApolymerase, thus making it possible for this gene to be deleted from thevirus genome. A similar procedure may be used in the construction ofCELO virus vectors, by removing sections or the entire region from nt5,000 to 12,000 from the CELO virus and having the correspondingfunctions controlled in trans by a packaging cell line.

[0052] Another possible restriction exists with regard to the presumedmajor late promoter, which was provisionally assigned to the region atabout nt 7,000 (TATA box at nt 7,488). In the mastadenoviruses, thispromoter is essential for driving late gene expression. Therefore, anychange in the region at nt 7,000 of the CELO genome must be carried outin a way which maintains the promoter function of this region.

[0053] Table 4 lists the sequence elements of the CELO virus genome andis divided into various categories with regard to their deletion and/ormutation in the production of CELO virus vectors (in Table 4 L1, L2,etc. denote “late message 1, 2, etc., corresponding to the nomenclaturenormally used for mastadenoviruses):

[0054] Category 1 includes sequence elements which are required in cisand therefore cannot be made available in trans by a complementing cellline or by a complementing plasmid. These sections are necessary and aretherefore present on the CELO virus according to the invention; theseare the left and right terminal repeats and the packaging signal.

[0055] Sequences of category 2 code for proteins which are required inlarge amounts for virion production. These proteins may optionally beproduced by a gene contained in a complementing cell line or on acomplementing plasmid. Other sequences of category 2 are the major laterpromoter, the tripartite leader sequence and also the splice acceptorsites (SA) or the polyadenylation sites (poly A sites) of genes whichare essential and which cannot be made available in trans.

[0056] Fundamentally, care must be taken to ensure that, inmodifications of the CELO virus DNA which are carried out at theboundaries of genes, any control signals present, e.g. polyA sites arenot interrupted or affected in any other way if possible.

[0057] The genes deleted from the CELO virus or non-functional genestherein may be prepared in trans, e.g. by complementing cell lines.

[0058] Complementing cell lines (“helper cells”) can be prepared by themanner known from the literature, analogously to helper cells whichcomplement the functions of mammalian adenoviruses. To do this, therelevant CELO virus gene on a plasmid, preferably combined with aselectable marker, is introduced into cells which permit the replicationof CELO virus, preferably into immortalised cell lines such as LMH(Kawaguchi, et al., 1987) or immortalised quail cell lines, as describedfor example by Guilhot et al., 1993. In helper cells which express therelevant CELO virus genes, optionally stably integrated, the defectiveCELO viruses are able to replicate.

[0059] Instead of making the regions of the CELO virus deleted in thevector available by means of a cell line, the deletions may also becomplemented by a copy of the relevant gene contained on a plasmid. Forthis, the method described in WO 96/03517, that described by Cotten etal., 1994a and 1994 b) or the one described by Wagner et al., 1992, maybe used, for example, in which a CELO virus vector containing a deletionis inserted, as a component of a transfection complex, containing aconjugate of polylysine and a UV/psoralen-inactivated adenovirus (humanor CELO) and optionally transferrin-polylysine, into embryonic chickenkidney cells or liver cells, embryonic or immortalised quail cells, e.g.liver or kidney cells, and the transfection complex also contains aplasmid which carries a copy of the gene or genes which lack the CELOvirus vector. The combination of genes contained on the vector and genescarried by the plasmid results in a normal virus replication cycle.(Similar approaches were used in mastadenovirus systems in order tocomplement E1-deficient adenoviruses (Goldsmith, et al., 1994) andE4-deficient adenoviruses (Scaria, et al., 1995)). The subsequentamplification of the virus may be carried out by using the defectivevirus as a carrier which is dependent on the complementing plasmid usingthe methods described above.

[0060] Another possible way of replacing the genes missing from the CELOvirus is by using helper viruses.

[0061] The helper virus used may be a CELO virus (Wild type or partlydefective). In this embodiment the CELO plasmid carrying the mutation(e.g. a derivative of pCELO7), is introduced into chicken cells, e.g.using the method described in WO 96/03517 or described by Cotten et al.,1994, using as the carrier for the derivative, for example,psoralen/UV-inactivated adenovirus (human or CELO) together with anadenovirus (human or CELO) as a carrier for the plasmid or plasmids withthe genes which complement the defect. Alternatively, a Wild type CELOvirus may be used both as a carrier and as source for complementing genefunctions. The subsequent amplification of the defective CELO virusesobtained is carried out by co-infection of the defective CELO virus witha complementing adenovirus (e.g. Wild-type CELO or a CELO which hasmutations at other points of the genome).

[0062] CELO virus genes of category 3 include the sequences on SectionsA, B and C. These are sequences which code for a protein or an RNAmolecule which is necessary for the interaction with the host cellmachinery or with the host immune system. These proteins should berequired in fairly low concentrations or may be dispensable forcultivation of the virus in the tissue culture.

[0063] Thus, preferably, in the CELO virus vectors according to theinvention the genes of category 3 are replaced by the gene in question;if necessary, complementary cell lines or plasmids or helper viruses canbe prepared which produce the corresponding gene products.

[0064] In one embodiment of the invention the vectors according to theinvention contain the gene in question instead of one of the fibregenes. The CELO virus has two fibre proteins (Layer, et al., 1971;Gelderblom and Maichle-Lauppe, 1982; Li, et al., 1984a). It can beassumed that one of the fibres of the CELO virus is not necessary forthe assembly of the virion and the infectivity. This assumption isbacked up by electron microscopic observations that the longer fibre(fibre 1) should associate with the penton base along the side of thecomplex, whilst the shorter fibre (fibre 2) projects out of the middleof the penton base, similarly to the penton/fibre complexes in themastadenoviruses (Hess, et al., 1995). In adenoviruses with only asingle fibre, the fibre molecule is required for the assembly of thevirus; in the absence of fibres no stable mature viruses are formed. TheCELO virion should therefore require fibre 2 for stability and as aligand, whereas fibre 1 acts only as a ligand. Within the scope of thepresent invention the assumption that, of the two fibre genes of theCELO virus, fibre gene 1 located in region C is superfluous and can bereplaced by the gene in question was proved correct by removing thefibre 1 gene and replacing it with a luciferase expression unit.

[0065] Other examples are inserts in region A and/or B.

[0066] Within the scope of the present invention it was found thatdestruction of the reading frame at nt 794 (region A) which codes fordUTPase yields viable viruses. The dUTPase gene is thus a gene which isnot necessary for growth in cell culture.

[0067] In one embodiment of the invention, the recombinant CELO virusthus contains a foreign gene which is inserted in the region of thereading frame coding for dUTPase.

[0068] According to another aspect the present invention relates to aprocess for preparing recombinant CELO virus.

[0069] The process is characterised in that the CELO virus genomecontained on a plasmid or sections thereof is or are geneticallymanipulated.

[0070] In one aspect of the invention the genetic manipulation consistsof insertion and/or deletion. Insertions and/or deletions can be carriedout by using restriction enzyme cutting sites which occur naturally inthe CELO virus DNA in these sections, e.g. the FseI cutting siteoccurring at position 35,693 in Section B. The insertion may be carriedout directly into this cutting site or beyond this cutting site or closeto this cutting site, or this cutting site may be used to allowrecombination in the surrounding area.

[0071] In one preferred embodiment the manipulation consists of carryingout insertions and/or deletions using standard methods of molecularbiology (Maniatis, 1989). The naturally occurring restriction cuttingsites can be used for this, e.g. sites located in regions of the genomewhich are non-essential for cultivation of the virus in the host cell,e.g. the FseI cutting site which occurs at position 35,693 in Section B.The insertion can be made directly into this cutting site or beyond thiscutting site or in the vicinity of this cutting site, or the cuttingsite may be used to facilitate recombination in the surrounding area.Foreign DNA sequences can be inserted, e.g. marker genes or genes codingfor therapeutically active proteins.

[0072] An alternative possibility is to remove CELO sequences which areflanked by two restriction sites and replace them with new sequences.(An example of this is dUTPase mutation as carried out in Example 7.) Inthese cases the manipulation is carried out with the entire CELO virusgenome. In the event that restriction sites are present, thedeletion/insertion can alternatively be carried out on a subfragmentwhich is then re-incorporated in the entire genome by ligation andoptionally recloning in bacteria.

[0073] Another possibility is to insert the foreign gene in artificialrestriction enzyme cutting sites produced by conventional methods ofrecombinant DNA technology (Maniatis, 1989).

[0074] In one embodiment the process is characterised in thatmanipulations are carried out in a plasmid DNA which contains the CELOvirus genome, in CELO DNA sequences, with the exception of the left andright inverted terminal repeats and the packaging signal.

[0075] In another preferred method, the manipulation of the CELO genomeis carried out by recombination. For this, a subfragment of the CELOgenome is manipulated in order to introduce mutations and/or newsequences. Subfragments can be prepared by various methods, specificallyby PCR (polymerase chain reaction), by ligation between PCR products orbetween restriction fragments or by subcloning in bacteria (as describedin the Examples of the previous invention; see also Chartier et al.,1996). Examples of suitable bacteria strains for recombination are BJ5183 (Hanahan, 1983) or JC 8679 (Gillen et al., 1974) or JC 5176(Capado-Kimball and Barbour, 1971).

[0076] For recombination using PCR products, the sequence to be insertedinto the CELO genome is prepared by PCR (Oliner et al., 1993) usingprimers which flank the sequence plus about 15 nucleotides of thesequence complementary to the insertion site in the CELO genome. In asecond round of PCR, another 15 nucleotides are hung from the sequencecomplementary to CELO, resulting in a PCR product which consists of thesequence to be inserted with 30 nucleotides of the CELO sequence at eachend. This fragment is mixed with a plasmid which contains the CELO DNA(e.g. the plasmid pCELO7 prepared within the scope of the presentinvention) and which has been linearised with a restriction enzyme whichcuts only between the two flanking sequences hung from the sequence byPCR.

[0077] For recombination using ligation reaction products (prepared byconventional techniques as described for example by Maniatis et al.,1989), in principle the same procedure is used as in the recombinationwith cloned fragments, except that the intermediate cloning step isomitted.

[0078] In every case, the manipulated product obtained is characterisedand used to prepare virus by transfecting avian cells (e.g. using themethod described by Wagner et al., 1992; Cotten et al., 1994; or Cottenet al., 1993) and then cultivating them, after which the virus isharvested.

[0079] For preparing recombinant CELO virus using cloned fragments, themethod preferably comprises subcloning a small fragment from therelevant region of CELO virus into which the foreign gene is to beinserted on a bacterial plasmid in order to ensure that restrictionsites which occur several times on the CELO virus genome occur only onceon the plasmid. These restriction sites are used to remove a region fromthe small fragment. For preparing the CELO virus vector this region isreplaced by foreign DNA. The foreign DNA may consist solely of a linkerwith a restriction site occurring only once or of a sequence coding fora protein or for an antigen. The sequence may also code for a reportergene with a restriction site occurring only once. This makes furthermanipulation of the CELO virus easier by inserting the foreign DNA,which codes for a therapeutically active gene product or for an antigen,into this restriction site, and at the same time the reporter genepermits rapid information as to the efficiency of the vector, byintroducing the plasmid into cells and monitoring the expression of thereporter gene.

SUMMARY OF THE FIGURES

[0080]FIG. 1A: Comparison of the genomic organisation of Ad2/5 with theCELO virus

[0081]FIG. 1B: Restriction map of the CELO virus genome

[0082] FIGS. 2A and B: Pulsed Field gel-electrophoretic analysis of thegenome size of adenoviruses

[0083]FIG. 2C: Characterisation of plasmid-cloned copies of the CELOvirus genome by means of restriction endonucleases

[0084]FIG. 3: Dot matrix analysis of the DNA sequence homology betweenCELO virus and Ad2

[0085]FIG. 4: Amino acid sequences of protein VII and pX from variousmastadenoviruses compared with CELO virus and the core proteins core 2and core 1

[0086]FIG. 5: Construction of a plasmid which contains the entire lengthof the CELO genome

[0087]FIG. 6: Preparation of a CELO vector from a copy of the CELO virusgenome contained on a plasmid

[0088]FIG. 7: Identification of bacterial clones which contain adeletion in the dUTPase gene

[0089]FIG. 8: Comparison of Wild-type CELO and CELO containing adeletion in the dUTPase gene by Western blot analysis.

[0090] In the Examples, the following materials and methods were usedunless otherwise specified:

[0091] a) Virus and Virus DNA

[0092] A plate-purified isolate from CELO virus (FAV-1, Phelps strain)which was used as starting material for the DNA both for directsequencing and for the formation of bacterial plasmid clones, was grownin 9 day old pathogen-free chicken embryos, as described by Cotten, etal., 1993. The FAV-1 isolates OTE (Kawamura, et al., 1963) and Indiana C(Calnek and Cowen, 1975; Cowen, et al., 1978) were cultivated in chickenembryo kidney cells. The virus was purified from the allantoic fluid orfrom infected embryo kidney cells by separation in CsCl gradients, asdescribed by Layer, et al., 1971, and Cotten, et al., 1993. Virus DNAwas isolated by treating the purified virions with proteinase K (0.1mg/ml) and SDS (0.2%) at 56° C. for 45 min and subsequent equilibriumcentrifugation of the DNA in a CsCl gradient in the presence of ethidiumbromide. After the second gradient, the ethidium bromide was removed byextraction with CsCl-saturated isopropanol and the virus DNA wasexhaustively dialysed against 10 mM Tris, 0.1 mM EDTA, pH 8.

[0093] b) Chicken Embryo Kidney Cells

[0094] The kidneys of 14 day old chicken embryos were collected, washedin PBS and digested with pancreas trypsin (2.5 mg/ml in PBS) at 37° C.The dispersed cells were mixed with an equal volume of foetal calfserum, the cells were collected by centrifugation, washed once with FCKmedium and taken up in the same medium again. (The FCK medium is medium199 with Earle's salts (Sigma M2154) supplemented with 10% tryptosephosphate (Sigma T8159), with 10% foetal calf serum, 2 mM glutamine, 100μg/ml streptomycin and 100 IU/ml penicillin.) The cells were plated outin 175 cm² tissue culture flasks (2 embryo kidneys per flask), stored at37° C. under 5% CO₂ and infected 24 to 48 hours later. The cells wereinfected with about 1,000 virus particles per cell and harvested 3 to 4days after infection when the cytopathic effect was complete.

[0095] c) Pulsed Field Electrophoresis

[0096] Aliquots of purified adenovirus DNA (10-20 ng) were loaded onto a1% agarose gel (BioRad, PFC agarose) and separated using a BioRad CHEFMapper Pulsed Field Electrophoresis system (FIGE mode) for 24 hours in0.5×TBE cooled to 14° C. The switching time, both in the forwarddirection and in reverse, was changed logarithmically from 0.22 secondsto 0.92 seconds with a ramp factor of 0.357 (21%). The forward voltagegradient was 9 V/cm (300 V), the reverse voltage gradient was 6 V/cm(200 V). After the run the gel was stained for 25 minutes in 0.5 μg/mlof ethidium bromide solution in water and then destained for 1 hourbefore the DNA pattern was made visible under UV light.

[0097] d) Sequencing Methods, Data Analysis

[0098] For the sequencing, EcoRI and HindIII restriction fragments ofCELO virus DNA were cloned in pBlueScript SK(−). Three of the EcoRIclones containing the EcoRI fragments C, D and E (see FIG. 1B) and fiveof the HindIII clones containing the HindIII fragments F, A, G, B and E(cf. FIG. 1B, were selected for the preparation of deletions in onedirection using exonuclease III (in FIG. 1B the cleavage sites for therestriction enzymes EcoRI, HindIII, BamHI and BglII are given; thealphabetical names for the EcoRI and HindIII fragments, on the basis oftheir relative sizes, are also given). These deletion clones weresequenced using the Taq Dyedeoxy Terminator System with the automaticsequencing apparatus ABI 373 according to the manufacturer'sinstructions. The sequence analysis of the terminal 2,000 bp at the lefthand end and the 1,000 bp at the right hand end of the CELO virusgenome, the sequencing to close the gaps between the fragments EcoRIC/HindIII G and the fragments HindIII B/EcoRI D and the sequencing toconfirm the sequence at various points of the genome were carried out bydirect sequencing of the viral DNA. All the sequence data are theresults of at least three sequence reactions. The sequence data werecombined using the programs SeqEd (ABI) and SeqMan (Lasergene). Thesequence analysis was carried out using the program GCG of theUniversity of Wisconsin.

EXAMPLE 1

[0099] Preparation of a Recombinant Bacterial Plasmid Clone of the CELOVirus Genome

[0100] a) Preparation of a Plasmid Vector with a Low Copy Number forCloning the CELO Virus

[0101] The bacterial vector pBR327 (ATCC No. 37516) was chosen for thisbecause it is retained in bacterial host strains at relatively low copynumbers (instead of this plasmid any other plasmid with a low copynumber such as pBR322 could be used equally well). It was essential tocreate on the vector a restriction site which occurs only once and whichdoes not appear in the CELO virus sequence. As described hereinafter,the virus sequence has to be cut from the plasmid vector sequences inorder to inject a productive infection; therefore, restriction siteswhich flank the CELO sequence (but which are not present within the CELOsequence) have to be incorporated in the vector. In the experimentscarried out, the restriction enzyme SpeI was used; however, any otherenzymes which do not have recognition sites in the CELO sequence, suchas AscI, PacI and SfiI, may be used instead.

[0102] The plasmid p327SpeI was prepared by ligating an SpeI linker (NewEngland Biolabs) into the Klenow-treated EcoRI site of pBR327, therebydestroying the EcoRI site and creating an SpeI site which occurs onlyonce.

[0103] b) Cloning the Ends of CELO

[0104] The two terminal HindIII fragments were cloned. In order to dothis, CsCl-purified genomic CELO DNA was digested with HindIII andseparated on a low-melting agarose gel (0.7% low melting agarose inTAE). The 1601 bp left hand end fragment and the 959 bp right hand endfragment were cut from the gel, and each gel fragment was suspended in300 μl of 10 mM Tris, 1 mM EDTA pH 7.4 and heated to 70° C. for 10 minto melt the agarose. The terminal peptides were eliminated by theaddition of NaOH to 0.3 N and heating to 37° C. for 90 min. (Hay, etal., 1984). The solutions were then cooled to ambient temperature, thenTris pH 7.4 (to 0.1 M) and HCl (to 0.3 M) were added in order toneutralise the NaOH. The fragments were heated to 56° C. for 20 min. andslowly cooled (over 1 hour) to ambient temperature in order tofacilitate re-annealing. Then the DNA was purified over a Qiaquickcolumn and ligated for 4 hours at 16° C. using a Pharmacia T4 ligasereaction (New England Biolabs) to an SpeI linker (New England Biolabs).The ligase was inactivated by heating to 70° C. for 10 minutes, excesslinker was removed (and an overhang complementary to SpeI was formed) bydigesting for 2 hours with restriction endonuclease SpeI. The DNAfragments were in turn purified by Quiquick column chromatography andligated to p327SpeI treated with SpeI/HindIII/calf alkaline phosphatase.The ligation product was transformed into the bacterial strain DH5alpha,and plasmid clones were identified which carried either the 1601 bp lefthand end fragment or the 959 bp right hand end fragment (both releasedby SpeI/HindIII digestion). In order to confirm the terminal 300 bp ofboth fragments, DNA sequence analysis was carried out.

[0105] c) Cloning of both CELO Ends on the same Plasmid

[0106] The 1601 bp left hand end and the 959 bp right hand end fragmentwere cut from their vectors by HindIII/SpeI digestion, separated by gelelectrophoresis and purified by Qiaquick chromatography. The twofragments were mixed in approximately equimolar amounts and ligated for30 minutes using the Pharmacia T4 ligase reaction. An aliquot ofSpeI/CIP-treated p327SpeI was added and ligation was continued for 4hours. The ligation mixture was transformed in DH5alpha and plasmidclones were identified which carried the correct double insert (pwu#1and pWu#3).

[0107] The second HindIII site was removed by cleaving pWu#3 with ClaIand BamHI, treating with Klenow enzyme, religation, transformingDH5alpha and selecting a clone which

[0108] the ClaI/BamHI (which had contained a HindIII site). Theresulting plasmid designated pWu-H35 now contained a single HindIII sitebetween the left and right hand CELO end fragments.

[0109] d) Cloning the Entire CELO Genome

[0110] The plasmid pWu-H35 obtained in c) was treated with HindIII andCIP and purified on a low melting agarose gel following by Qiaquickchromatography. The linearised vector pWu-H35 was mixed with 0.3 μg ofpurified CELO virus DNA, then 30 μl of electro-competent bacterialstrain JC8679 (Gillen, et al., 1974; Oliner, et al., 1993) were added tothe DNA mixture on ice. 10 Minutes later the mixture was transferredinto a BioRad Electroporation chamber and pulsed with an electric chargeof 2.4 kV (BioRad Gene Pulser; Oliner, et al., 1993). The bacteria werethen plated onto LB ampicillin plates and the ampicillin-resistantcolonies were investigated for their plasmid content. Recombinationbetween the terminal CELO sequences on pWu-H35 and the ends of thegenomic CELO DNA re-establishes the circularity of the linearisedplasmid and allows growth on ampicillin. A plasmid which contains theCELO genome over its full length was identified, and this plasmid,referred to as pCEL07, was used for the subsequent investigations. FIG.2C shows the characterisation of plasmid-cloned copies of the CELO virusgenome. Plasmid DNA from clones designated pCEL07, 8, 9 and 13 or DNAisolated from purified CELO virus, was digested either with BglII(tracks 2-6) or HindIII (tracks 7-11) and separated on a 0.6% agarosegel, and the DNA was shown up by ethidium bromide staining. Themolecular weight marker (tracks 1 and 12) was bacteriophage λ-DNA cutwith HindIII and EcoRI. The sizes of some molecular weight fragments (inbase pairs) are shown on the right of the Figure. For each enzyme, thetwo CELO end fragments which are bound to the bacterial plasmid duringcloning (and which are therefore not released after restrictiondigestion) are given on the left of the Figure (in base pairs). Theseare the fragments with 5832 and 5102 bp with BglII or 1601 and 959 withHindIII.

[0111] The construction of pCEL07 is shown in FIG. 5.

[0112] e) Initiation of a CELO Virus Infection by a Cloned CELO Genome

[0113] pCELO7 was cleaved with SpeI (which cleaves at the sites flankingthe adenovirus termini), extracted with phenol/chloroform and passedover an HBS-equilibrated gel filtration column (Pharmacia Nick column)to remove any impurities. The cleaved DNA was then incorporated instreptavidin-polylysine/transferrin-polylysine/biotin-adenovirus(UV/psoralen-inactivated) as described in WO 93/07283. Complexescontaining 0.5 μg of SpeI-cleaved pCEL07 plus 5.5 μg of carrier DNA(pSP65; Boehringer Mannheim) were used to transfect primary embryonicchicken kidney cells (the complexes contain 4 mg of DNA per 180 cm²flask, containing about 3×10⁶ cells), and the cells were investigatedfor the cytopathic effect caused by virus replication. Five days aftertransfection, when the majority of the transfected cells had beenrounded off and detached from the surface of the plate, the cells wereobtained by centrifuging and the CELO virus was purified as described byCotten, et al., 1993. The virus yield from plasmid cloned CELO virus iscomparable with the yields obtained by using pure CELO virus DNA(purified from virions).

EXAMPLE 2

[0114] Preparation of a CELO Mutant which Lacks the Sequences from nt35,870 to 42,373 at the Right Hand End

[0115] There are no identifiable viral structural genes beyond the twofibre genes with the L5-polyadenylation site at position 31771. (Thereis a cryptic VA-gene at positions 39,841 to 39,751.) Investigations weretherefore carried out to see whether the sequences between about 32,000and the right ITR are necessary for the growth of the virus in cellculture. For this an accumulation of seven AseI sites were used atpositions 35,870, 36,173, 38,685, 38,692, 39,015, 42,348 and 42,373,which does not appear anywhere else in the CELO virus genome. pCELO7 wasdigested with AseI, religated and a plasmid which lacked the inner AseIfragments was identified and designated pALMCELO_(—)35870-42373. Inconnection with this it should be noted that the plasmid vector also hasan AseI site; however, this is located in the ampicillin resistancegene, and selection for ampicillin resistance requires that all positivecolonies have at least the two fragments which carry the right and lefthand halves of the amp gene.

[0116] To aid further manipulations of the virus with the missing lefthand end of the genome, pALMCELO_(—)35870-42373 was digested with AseI(which cuts once in the ampicillin resistance gene of the plasmid andonce at position 35,870) and ligated to a linker oligonucleotideTACCCTTAATTAAGGG which codes for a cutting site for the restrictionendonuclease PacI and for ends which are complementary to those formedduring AseI digestion. Religation, followed by selection for ampicillinresistance, identified plasmids which did not integrate theoligonucleotide at the AseI site of the ampicillin resistance gene.Restriction digestion identified a plasmid which carried a PacI site atthe earlier AseI site of CELO at position 35,870. The plasmid wasdesignated pALMCELO_(—)35870-42373P.

EXAMPLE 3

[0117] Preparation of a CELO Virus Vector in which a Fibre Gene isMissing which is Replaced by a Gene of Interest

[0118] The CELO fibre genes are contained on a HindIII fragment whichextends from nt 27,060 to 33,920 (the HindIII B-fragment, cf. therestriction map in FIG. 1B). On this fragment the sequence coding forfibre 1 extends from nt 1,054 to 3,435. The 5H3 fragment was digestedwith BglII (which cuts at nt 1,168) and HpaI (which cuts at 3,440), theBglII end was filled with Klenow enzyme and ligated to a bluntCMV/luciferase/β-globin cleavage/polyadenylation signal fragment fromthe plasmid pCMVL (Plank et al., 1992) to form the plasmidp5H_(—)28227-30502(luc) which lacks almost the entire fibre 1 sequencewhich is replaced by a luciferase expression unit.

[0119] The relevant restriction cutting sites in CELO are as follows:BglII A′GATC_T Cuts at: 0 5102 15979 23472 28227 37972 43804 Size 5102 10877  7493  4755  9745  5832 HindIII A′AGCT_T Cuts at: 0 16015626 17881 23327 27060 33920 38738 42845 42845 43804 Size: 1601 4025 12255 5446  3733  6860  4818  4107  959 HpaI GTT′AAC Cuts at:0 5503 20673 23355 30502 43804 Size:  5503  15170  2682  7147  13302NotI GC′GGCC_GC Cuts at: 0 17389 43804 Size:  17389  26415 XbaI T′CTAG_ACuts at: 0 1659 1988 28608 39268 41746 43804 Size: 1659  329  26620  10660  2478  2058

[0120] The modifications which were carried out onp5H_(—)28227-30502(luc) were introduced into the entire CELO genome inthe following manner: The CELO/luciferase/CELO fragment was cut fromp5H_(—)28227-30502(luc) as a HindIII fragment. This fragment wasrecombined with the 26kb XbaI fragment (CELO nucleotides 1988-28608) andthe terminal HpaI fragments derived from pCELO7 (obtained by cuttingwith HpaI, containing the left hand end of the CELO virus and pBR327sequences, defined by the HpaI sites). The three DNA fragments (eachabout 50 ng) were mixed in water and electroporated in JC8679 cells asdescribed above.

EXAMPLE 4

[0121] Insertion of a Reporter Gene (Luciferase) in the CELO Genome

[0122] i) Preparation of a Left Hand End Fragment Containing a CMVLuciferase Construct

[0123] The EcoRI fragment designated 7R1 (nucleotides from positions 79to 8877) was cloned into a pSP65 derivative designated pAAALM (describedin WO 95/33062). The plasmid was transformed into the DAM methylasenegative bacterial strain JM110 in order to allow cleavage of the ClaIsites in the fragment. The plasmid was purified, cut with ClaI (atposition 1083) and NcoI (at position 4334), treated with Klenow enzymeto fill the overhanging ends and ligated to a bluntCMV/luciferase/β-globin cleavage/polyadenylation signal (Plank et al.,1992). The resulting plasmid was designated p7R1_(—)1083-4334Luc.

[0124] ii) Recombination of the Luciferase Left Hand End Fragment into aComplete (Full Length) CELO Sequence

[0125] The plasmid p7R1_(—)1083-4334Luc was cleaved with Eco47 III,which cleaves at the CELO nucleotides 937, 1292, 2300 and 8406 (thesites at nucleotide 1292 and 2300 are absent from p7R1_(—)1083-4334Luc)in order to release a large fragment containing the sequenceCELOnt937-1083/CMVLusPA/CELOnt4334-8406.

[0126] This fragment was recombined in pCELO7. pCELO7 was cleaved at thesingle PmeI site at CELO nt7433 and exhaustively dephosphorylated withcalf intestinal phosphatase. The linearised pCEL07 was mixed with anapproximately 3 to 5 molar excess ofCELOnt937-1083/CMVLucPA/CELOnt4334-8406. The mixture was electroporatedinto the bacterial strain JC8679 and ampicillin-resistant colonies wereexamined on plasmids which contain the desired recombinant DNA. Thecorrect plasmid was identified, characterised by restriction enzymeanalysis and designated pCELOLucI.

[0127] iii) A CELO Virus Expressing Luciferase was Prepared byTransfecting pCELOLucI into Primary Embryonic Chicken Kidney Cells asDescribed above.

EXAMPLE 5

[0128] Preparation of a CELO Vector from a Copy of the CELO Virus GenomeContained on a Plasmid

[0129] The region between the DraIII site (originally contained at nt34,426 in the CELO virus genome) and the XhoI site (originally containedat nt 36,648 in CELO virus genome) were removed from the plasmid pAALMH3which contains the HindIII fragment from nt 33,920 to nt 38,738, clonedin pAALM. Then it was treated with T4-DNA polymerase to produce bluntends and ligated with the CMV/luciferase/β-globin fragment (cf. Example4). In this way the plasmid p7H3Δ34426-36648 Luc was obtained. TheCELO/luciferase/CELO fragment was cut on a HindIII fragment and insertedinto the CELO genome of pCELO7 by recombination via the FseI siteoccurring only once at position 35,694. This yielded the plasmidPCELOΔ34426-36648Luc. Digestion with SpeI and transfection intoembryonic chicken kidney cells yielded a virus CELOΔ34426-36648Luc. Thenfurther insertions were carried out replacing the luciferase sequencewith other genes of interest, using the once occurring PacI site whichwas introduced with the luciferase sequence.

[0130]FIG. 6 shows the cloning strategy used in this Example in generalform: a small CELO fragment is subcloned into a plasmid (containingrestriction site C); the restriction sites A and B which occur only oncein this plasmid are used to replace the sequence with foreign DNA. Asthe next step, the entire fragment containing the foreign DNA betweenCELO sequence is cut from the plasmid and mixed with the plasmid whichcontains the entire CELO DNA and which has been cut with a restrictionenzyme (D) which cleaves the CELO-DNA only once. With this mixture,bacteria (e.g. of the strain JC8679; Oliner et al., 1993; or anotherbacterial strain with a similar capacity for recombination) aretransformed; recombination yields the desired plasmid containing theforeign DNA as an insert in the CELO virus genome.

EXAMPLE 6

[0131] Preparation of a Quail Cell Line which Complements the 7R1Deletions and/or the 9R1 Deletions in CELO

[0132] The plasmids pX7R1 and pX9R1 (described in WO 95/33062) wereintroduced into primary embryonic quail kidney or liver cells bytransferrinfection as described in WO 93/07283. Four days aftertransfection the cells were trypsinised and seeded at ⅕ of the originaldensity. The cells were fed twice a week with FCK medium. Clonal lineswere expanded and clones which carried either the 7R1, 9R1 plasmid orboth plasmids were identified by PCR analysis. The RNA expression of theintegrated plasmids was determined by Northern analysis.

EXAMPLE 7

[0133] a) Preparation of a CELO Virus Genome with a Mutation in theORF₇₉₄ DUTPase Gene

[0134] A plasmid designated pWuΔdut was produced by removing a 540 bpAflIII-SacI fragment from the ORF₇₉₄ in pWu-H35 (see Example 1 c). Inorder to product pCELOΔdut, pWuΔdut was linearised with HindIII anddephosphorylated using alkaline shrimp phosphatase. After gelpurification the DNA was mixed with purified CELO DNA and used totransform E. coli BJ5183 (Degryse, 1996) to ampicillin resistance. Fromthe ampicillin resistant bacterial colonies obtained the DNA wasextracted and E. coli DH5a was transformed therewith. DNA extracted fromthese bacteria was analysed by restriction mapping in order to identifyrecombinant virus plasmids. The identity of the clones was determined byrestriction mapping (FIG. 7; pWu-H35 is designated “pWu” in the Fig. ).The digestion of the Wild type plasmid pWu-H35 with HindIII and SpeIyields fragments of 2944 bp, 1607 bp and 961 bp (track 2). The deletionwhich changes the dUTPase converts the 1607 bp fragment into a 1071 bpfragment (track 3; the modified fragments are marked with an asterisk).The plasmids which contain the complete sequence coding for CELO or thecomplete CELO sequence plus the dUTPase mutation were analysed bySpeI/HindIII digestion and showed the same change of the 1607 bpfragment into a 961 bp fragment (tracks 4 and 5).

[0135] b) Preparation of Recombinant CELO Clones from Chicken Cells

[0136] Either 6 μg pCELO7 (see Example 1 d) or 6 μg of pCELOΔdut (seeabove, digested with SpeI) were used to transfect primary embryonicchicken kidney cells (approximately 500,000 cells in a 2.5 cm well)using polyethylene amine (PEI)/adenovirus complex. For this, theQiagen-purified DNA was extracted with Triton X-114 in order to removelipopolysaccharide as described by Cotten et al., 1994. Transfectioncomplexes were prepared by diluting 6 μg of digested DNA in 250 μl of 20mM HEPES, pH 7.4. 20 μl of 10 mM PEI (molecular weight 2,000, pH 7) werediluted in 250 μl of 20 mM HEPES, pH 7.4. The PEI solution was addeddropwise to the DNA solution, incubated for 20 minutes at ambienttemperature and then mixed with 1.5 μl of an adenovirus preparation(psoralen/UV-inactivated adenovirus type 5, cf. WO 1719, 1.5×10¹²particles/ml). After another 20 minutes the complex was added to thecells in DMEM without serum (250 μl of complex to 1.25 ml of medium).The medium was changed for normal growth medium (with serum) and 4 to 5days later the cells were harvested, taken up again in 100 μl of HBS andsonicated for 2 minutes. A 10 μl aliquot of this sonicate (virus inpassage 1) was used to infect the same number of primary embryonicchicken kidney cells in a 2.5 cm well in a cell culture plate. Afteranother 4 to 5 days the cells were counted in order to determine thecytopathic effect (CPE end point assay or plaque assay, Precious andRussel, 1985). The cells were harvested (virus in passage 2) as in thefirst step and used to infect fresh chicken cells; harvesting thesecells yielded viruses of the third passage, the analysis of which isshown in FIG. 8.

[0137] c) Western Blots

[0138] The virus infected cells were harvested, taken up in HBS andsonicated. Aliquots were mixed with 5x charging buffer (250 mM Tris-Cl,pH 6.8, 500 mM DTT, 10% SDS, 0.5% bromophenol blue, 50% glycerol),heated to 95° C. for 3 minutes and then allowed to run in a 10%polyacrylamide gel. The proteins were transferred onto nitrocellulose,blocked overnight in 5% skimmed milk/TBST (10 mM Tris-Cl, pH 7.4, 150 mMNaCl, 0.05% Tween 20). Viral proteins were shown up usinganti-CELO-antisera from rabbits (1:1000) and anti-rabbit-horseradishperoxidase (DAKO; 1:20000) and made visible by ECL (Amersham). CELOvirus (2.5×10¹² virus particles/ml) was used as the control. The rabbitserum was prepared using CsCl-purified CELO virus and heat-inactivatedat 60° C. for 30 minutes.

[0139] d) Extraction of Virus DNA

[0140] Virus infected cells were harvested and taken up in 100 μl ofHBS/0.1% SDS/1 g/ml proteinase K, incubated for 1 hour at 56° C. andextracted with phenol/chloroform. The DNA was precipitated with ethanol.

[0141] The analyses carried out showed that recombination between thelinearised plasmid pWuΔdut and the CELO DNA yielded two types ofplasmids. Recombination to the left hand end of the dUTPase mutationyielded a Wild type CELO genome; recombination on the right hand side ofthe dUTPase mutation yielded a CELO genome which carried the dUTPasemutation.

[0142] The infection of primary embryonic chicken cells both with CELODNA and with CELO-Δdut-DNA produced cytopathic effects which took theform of swollen, detached cells after 36 hours, whereas control cells(treated with lysates of cells which had been transfected with an emptyvector (Bluescript pBS, Stratagene)) remained healthy in theirmorphology.

[0143] Western blot analysis showed that CELO viruses and CELO Δdutviruses produced by plasmid DNA are indistinguishable from Wild typevirus grown in fertilised, 9 day old hens eggs.

[0144] In all, the tests carried out in this example show that pCEL07codes a viable CELO virus genome. After excision with SpeI andtransfection into primary embryonic chicken kidney cells this DNA yieldsinfectious, passagable virus. Lysates of viruses of the 1st and 2ndpassage produce a cytopathic effect on primary embryonic chicken kidneycells. It should be noted that these lysates were produced bysonication, which rules out the possibility that the CPE in thesecondary and tertiary infections can be put down to the expression ofviral genes which originates from the residual plasmid DNA in thelysates; plasmid DNA cannot be expected to withstand the method used toproduce the lysates. In addition, it was found that with each round ofinfection there is a 100 fold amplification of the agent which causesCPE; this conforms to the amplification of a virus but not to the singlepassage of residual plasmid DNA from the first transfection.

[0145] The deletion of 540 bp in the CELO genome with which sequencesbetween an AflIII site at bp 609 and a SacI site at bp 1145 were removedand the open reading frame coding for dUTPase was destroyed, yielded avirus genome which is viable even in primary embryonic chicken cells.With the UTPase gene a virus gene was thus identified which is notnecessary for growth in cell culture. TABLE 1 CELO virus Sequences,published or from data banks differences between data bank publishedsequences and accession coordinates in new sequence (GenBank # authors,publication size the sequence Accession #U46933) remarks Aleström, etal, 1982b  101 bp  1-101 7 bases different 5′ITR K00939 Shinagawa, etal, 1983  68 bp  1-68 7 bases different 5′ITR Z17216, Akopian, et al.,1992 3576 bp   1-3576 3 bases different 92-100% labelled S61107 1missing base slight differences in 3 additional bases these 2 versionsZ48167 unpublished 3433 bp 13597-17033 6 bases different contains genesfor 4 missing bases penton base and core 1 additional base proteinsL13161 Cai, et al., 1993  900 bp 21023-21922 no differences containsprotease gene X84724 Hess, et al., 1995 7359 b  27060-34299 2 basesdifferent contains genes for 6 missing bases pVIII, fibre 1, fibre 2 3additional bases 11 (GCA) repeats (new sequence shows 9) 6 doubtfulbases M12738 Larsson, et al., 1986  440 bp 39584-40023 no differencescontains VA gene Z22864 unpublished 3670 bp 35235-38905 2 basesdifferent assigned by the 4 missing bases authors: 11.2-19.2% 2additional bases X17217 Akopian, et al., 1990 4898 bp 38906-43804 2bases different assigned by the authors: 11.2% K00940 Shinagawa, et al,1983  68 bp 4374-43804 7 bases different 3′ITR Aleström, et al.,  124 bp43680-43804 2 bases different 3′ITR 1982b 1 missing base

[0146] TABLE 2A Organisation of the CELO Virus Genome Cap, cleav- age,Amino polyA MW acid Protein ATG STOP sites Dalton groups Remarks L1 52K12193 13329 42094 378 IIla 13316 15043 63771 575 Protease cleavage siteat amino acid 551 L2 15080 Penton base SA Penton 15110 16657 56719 515noRGD base 16196 poly A-site pVII 16679 16897 8562 72 Protease cleavagesites at amino acids 27, 40 mu 16929 17495 19787 188 Protease (pX,cleavage sites 11K) at amino acids 125, 144 17526 poly A-site L3 pVI17559 18230 23890 223 Protease cleavage sites at amino acids 28, 212Hexon 18289 21117 106704 942 18261 Hexon SA Pro- 21134 21754 23763 206tease 21102 Protease SA or 21123 21767, L3 poly A-site 21836 L4 23608100K SA or 23649 100K 23680 26634 109905 984 pVIII 27149 27886 26876 245Protease cleavage sites at amino acids 40, 115, 130, 141, 166 27920 L4poly A-site L5 28315 fibre SA or 28341 fibre 1 28114 30495 81526 79330509 [GCA]9 repeat 30511 fibre SA fibre 2 30536 31768 42939 410 31771L5 poly A-site VA 39841 RNA to 39751 E2 and IVa2 IVa2 6685 5366 50366439 E2b 10268 6501 144984 1255 pol E2b 11996 10269 66089 575 ProteasepTP cleavage sites at amino acids 116, 141, 260, 264 DBP 23224 2189949272 441 23292 DBP cap site 21824 DBP poly A or site 21882

[0147] TABLE 2B Non-assigned open readin frame, larger than 99 aminoacid groups Right ORF's Left ORF's ATG STOP groups ATG STOP groups  794 1330 178  5094  4462 210  1999  2829 276  4568  3549 339  3781  4095104  3374  2892 160  5963  6373 136  1514  1191 107 33030 33476 14839705 39286 139 33169 33483 104 39256 38717 179 35629 36024 131 3614435536 202 37391 38239 282 35599 34238 453 40037 41002 321 33707 32892271 41002 41853 283 33058 32735 107 41958 42365 135 32429 31812 251

[0148] TABLE 3 Recombinant Adenovirus vaccines Pathogen ReferenceComments Respiratory Hsu et al., (1994) Glycoprotein F and G, Syncytialinserted into the Virus E3 region of Ad 4, 5 or 7 Hepatitis BChengalvalva et al., (1994) HBsAg, inserted into the E3 region of Ad 4oder 7 Pseudorabies Eloit et al., (1990) Pseudorabies Glyco- proteingp50, inserted into the E1 region of Ad 5 Herpes Simplex Zheng et al.,(1993) Tandem repeats of the epitope of gD Herpes Simplex Gallichan etal., (1993). Glycoprotein B Rotavirus Both et al., (1993) Rotavirusantigen, inserted into the E3 region HIV Natuk et al., (1993) Ad 4, 5 or7, HIV env, or gag-protease gene SIV Cheng et al., (1992) SIV Env Rev,inserted into the E3 region of Ad 5 Rabies Kalicharran et al., (1992) Ad5 with rabies glycoprotein Rabies Charlton et al., (1992) Ad 5 withrabies glycoprotein Human Marshall et al., (1990) gB in E3 region of Ad5 Cytomegaloviras Measles virus Fooks et al., (1995) N protein in AdE1Region

[0149] TABLE 4 Characteristics of the CELO Virus Genome. CoordinatesCharacteristics (Nucleotides) Category Left terminal  0-68 1 RepeatPackaging signal  70-200 1 unknown open   0-5365 3 reading frame L1 52K12193-13329 2 IIIa 13316-15043 2 L2 penton base SA 15080 2 enton base15110-16657 2 Poly A site 16196 2 pVII 16679-16897 2 mu 16929-17495 2(pX, 11K) Poly A site 17526 2 L3 pYl 17559-18230 2 Hexon SA 18261 2Hexon 18289-21117 2 Protease SA 21102 or 21123 2 Protease 21134-21754 2L3 Poly A site 21767, 21836 2 L4 100K SA 23608 or 23649 2 100K23680-26634 2 pVIII 27149-27886 2 L4 Poly A site 27920 2 L5 fibre SA28315 oder 28341 2 or 3 fibre 1 28114-30495 2 or 3 Major Late 7350-76502 or 3 promoter (TATA box at 7488) Tripartite Leader  8651-8700, 8798-8857, 9682-9774 fibre SA 30511 2 fibre 2 30536-31768 2 L5 PolyAsite 31771 2 VA RNA 39841 to 39751 3 E2 and IVa2 IVa2 6685-5366 3 E2bpol 10268-6501 3 E2b pTP 11996-10269 3 DBP 23224-21899 3 DBP cap site23292 3 DBP PolyA site 21824 or 21882 3 right end open 31771 to 43804 3reading frame (right end) right terminal 43734-43804 1 repeat (roughlythe last 70 bp)

BIBLIOGRAPHY

[0150] Akopian, T. A., et al., 1990, Nucleic Acids Res. 18: 2825.(Corrigendum: Nucleic Acids Res 19: 424).

[0151] Akopian, T. A., et al., 1992, Mol. Gen. Microbiol. Virol. 11:19-23.

[0152] Aleström, P., et al., 1982a, J. Virology 42: 306-310.

[0153] Aleström, et al., 1982b, Gene 18: 193-197.

[0154] Anderson, J., et al., 1969a, J. Natl. Cancer Inst. 42: 1-7.

[0155] Anderson, J., et al., 1969b, J. Natl. Cancer Inst. 43: 575-580.

[0156] Anderson, C. W., et al., 1989, Virology 172: 506-512.

[0157] Bailey, A. and Mautner, V., 1994. Virology 205: 438-452.

[0158] Bennett, D. D. and Wrigth, S. E., 1987, Virus Res. 8: 73-7.

[0159] Bett, A. J., Prevec, L. and Graham, F. L., 1983, J. Virol. 67:5911-5921.

[0160] Both et al., 1993, Virology 193: 940-950.

[0161] Boulanger, P., et al., 1979, J. Gen. Virol., 44: 783-800.

[0162] Bridge, E. and Ketner, G., 1989, J. Virol. 63: 631-638.

[0163] Brown, P. H., et al., 1993, Oncogene 9: 791-799.

[0164] Cai, F. and Weber, J., 1993, Virology 196: 358-362.

[0165] Calnek, B. W. and Cowen, B. S., 1975, Avian Dis. 19: 91-103.

[0166] Capado-Kimball and Barbour, 1997, J. Bacteriol. 106: 204-212

[0167] Caravokyri, C. and Leppard, K. N., 1995, J. Virol. 69: 6627-6633.

[0168] Cavanagh, D., et al., 1988, Virus Res. 11: 141-50.

[0169] Charlton et al., 1992, Arch. Virol. 123: 169-179.

[0170] Chartier, C., et al., 1996, J. Virol. 70: 4805-4810

[0171] Cheng et al., 1992, J. Virol. 66: 6721-6727.

[0172] Chengalvalva, et al., 1994, J. Gen. Virol. 75: 125-131.

[0173] Chiocca, S., et al., 1996, J. Virol. 70: 2939-2949

[0174] Chroboczek, J., Bieber, F. and Jacrot, B., 1992, Virology 186:280-285.

[0175] Colby, W. W. and Shenk, T., 1981, J. Virol. 39: 977-980.

[0176] Cosset, F. L., et al., 1991, Virology 185: 862-6

[0177] Cotten, M., et al., 1993, J. Virol 67: 3777-3785.

[0178] Cotten, M., et al., 1994, Virology 205: 254-261

[0179] Cotten, M., et al., 1994, Gene Therapy 1: 239-246

[0180] Cowen, B., et al., 1978, Avian Diseases 22: 459-470.

[0181] Cunningham, C. H., 1975, Dev. Biol. Stand. 28: 546-62.

[0182] Davison, A. J., et al., 1993, J. Mol. Biol. 234: 1308-1316.

[0183] Degryse, E., 1996, Gene 170: 45-50

[0184] Denisova, T. S., Sitnikov, B. S. and Ghibadulin, R. A., 1979,Mol. Biol. (USSR) 13: 1021-1034.

[0185] Descombes, P., and Schibler, U., 1991, Cell 67: 569-579

[0186] Deshmukh, D. R., et al., 1974, Am. J. Vet. Res. 35: 1463-4.

[0187] Eloit et al., 1990, J. Gen. Virol. 71: 2425-2431.

[0188] Estes, M. K. and Graham, D. Y., 1985, Adv. Exp. Med. Biol. 185:201-14.

[0189] Everitt, E., et al., 1973, Virology 52: 130-147.

[0190] Fooks et al., 1995, Virology 210: 456-465.

[0191] Furcinitti, P. S., van Oostrum, J. and Burnett, R. M., 1989, EMBOJ. 8: 3563-3570.

[0192] Fynan, E. F., et al., 1993, DNA Cell Biol. 12: 785-9.

[0193] Gallichan et al., 1993, J. Infect. Dis. 168:622-629.

[0194] Gelderblom, H. and Maichle-Lauppe, I., 1982, Arch. Virol. 72:289-298.

[0195] Ghosh-Choudhury, G., Haj-Ahmad, Y. and Graham, F. L., 1987, EMBOJ. 6: 1733-1739.

[0196] Gillen, J. R., et al., 1974, Mechanisms in Genetic Recombination(R. F. Grell, ed.) Plenum, N.Y., pp. 123-135

[0197] Gooding, L. R., 1992, Cell 71: 5-7.

[0198] Gouvea, V. and Schnitzer, T. J., 1982, Infect. Immun. 38: 731-8.

[0199] Gouvea, V., et al., 1983, Virology 126: 240-7.

[0200] Gräble, M. and Hearing, P., 1990, J. Virol. 64: 2047-2056.

[0201] Gräble, M. and Hearing, P., 1992, J. Virol. 66: 723-731.

[0202] Graham, F. L., 1990, Trends in Biotechnology 8: 85-87.

[0203] Graham, F. L., et al., 1975, Cold Spring Harbor Symp. Quant.Biol. 39: 637-650.

[0204] Graham, F. L., et al., 1977, J. Gen. Virol. 36: 59-72.

[0205] Green, M., et al., 1967, Proc. Natl. Acad. Sci. USA 57:1302-1309.

[0206] Green, N. M., et al., 1983, EMBO J. 2: 1357-1365.

[0207] Guilhot, C., et al., 1993, Oncogene 8: 619-624.

[0208] Haffer, K., 1984, Avian Dis. 28: 669-76.

[0209] Hanahan, D., 1983, J. Mol. Biol. 166: 557-580

[0210] Hertmann, I., et al., 1979, Avian Dis. 23: 863-9.

[0211] Hertmann, I., et al., 1980, Prog. Clin. Biol. Res. 47: 125-32.

[0212] Hess, M., et al., 1995, J. Mol. Biol. 252: 379-385.

[0213] Hosokawa, K. and Sung, M. T., 1976, J. Virol. 17: 924-934.

[0214] Hsu, et al., 1994, Vaccine 12: 607-612.

[0215] Huang, D. D., et al., 1987, Avian Dis. 31:446-54.

[0216] Ignjatovic, J. and McWaters, P. G., 1991, J. Gen. Virol. 72:2915-22.

[0217] Javier, R., Raska, K. J. and Shenk, T., 1992, Science 257:1267-1271.

[0218] Javier, R. T., 1994, J. Virol. 68: 3917-3924.

[0219] Jia, W., et al., 1995, Arch. Virol. 140: 259-271.

[0220] Johnson, D. C., et al., 1988, Virology 164: 1-14.

[0221] Jones, N., and Shenk, T., 1978, Cell 13: 181-188.

[0222] Jones, R. F., Asch, B. B. and Yohn, D. S., 1970, Cancer Res. 30:1580-1585.

[0223] Kalicharran et al., 1992, Can J. Vet Res. 56: 28-33.

[0224] Kawamura, H., Sato, T., Tsubahara, H. and Isogai, S., 1963, Jap.Nat. Inst. Anim. Health Quart. 3: 1-10.

[0225] Keeler, C. L., et al., 1991, Avian Dis. 35: 920-9.

[0226] Kidd, A. H., et al., 1993 Virology 192: 73-84.

[0227] Kodihalli, S., et al., 1994, Vaccine 12: 1467-1472.

[0228] Kozarsky, K. F. and Wilson, J. M., 1993, Current Opinion inGenetics and Development 3: 499-503.

[0229] Kusters, J. G., et al., 1990, Vaccine 8: 605-8.

[0230] Larsson, S., Bellett, A. J. and Akusjarvi, G., 1986, J. Virol.58: 600-609.

[0231] Layer, W. G., Younghusband, H. B. and Wrigley, N. G., 1971, J.A., et al., 1989, Mol. Immunol. 26: 7-15.

[0232] Li, P., Bellett, A. J. D., and Parish, C. R., 1983, J. Gen.Virol. 64: 1375-1379.

[0233] Li, P., Bellett, A. J. D. and Parish, C. R., 1984a, J. Gen.Virol. 65:1803-1815.

[0234] Li, P., Bellett, A. J. D. and Parish, C. R., 1984b, J. Virol. 52:638-649.

[0235] Li, P., Bellett, A. J. D. and Parish, C. R., 1984c, J. Virol. 65:1817-1825.

[0236] Maizel, J. V., et al., 1968, Virology 36: 126-136.

[0237] Malkinson, M., et al., 1992, Arch. Virol. 127: 169-84.

[0238] Mancini, L. O., et al., 1970a, Arch. fur gesamte Virusforschung30: 257-260.

[0239] Mancini, L. O., et all, 1970b, Arch. fur gesamte Virusforschung30: 261-262.

[0240] Mangel, W., et al., 1993, Nature 361: 274-275.

[0241] Maniatis, T., et al., 1989, Molecular cloning, A laboratoryManual, Second Edition. Cold Spring Harbor University Press, Cold SpringHarbor, N.Y.

[0242] Marshall, et al., 1990, J. Infect. Dis. 162: 1177-1181.

[0243] Mautner, V., 1989, Adenoviridae. in: Porterfield ed. Andrewes'Viruses of Vertebrates. pp. 249-282. London: Baillierre Tindall.

[0244] McFerran, J. B. and Adair, B. M., 1977, Avian Pathology 6:189-217.

[0245] McCracken, R. M. and Adair, B. M., 1993, Avian adenoviruses. in“Viral Infections of Vertebrates” (volume 3. Viral Infections of Birds)Eds. J. B. McFerran and M. S. McNulty, Elsevier Scientific Publishers.Amsterdam.

[0246] Mittal, S. K., et al., 1995, J. Gen. Virol. 76: 93-102.

[0247] Moran, E., 1993, the FASEB J. 7: 880-885.

[0248] Moran, E., 1994, Seminars in Virology 5: 327-340.

[0249] Morrison, T, et al., 1990, Microb. Pathog. 9: 387-96

[0250] Moscovici, C., et al., 1977, Cell 11: 95-103.

[0251] Natuk, et al., 1993, AIDS Res. Hum Retroviruses. 9: 395-404

[0252] Ni, Y., and Kemp, M. C., 1992, J. Gen. Virol. 73: 3107-13.

[0253] Nicholas, R. A., et al., 1987, Arch. Virol. 96: 283-7.

[0254] Oliner, J. D., et al., 1993, Nucleic Acids Research 21: 5192-5197

[0255] Pieniazek, N. J., et al., 1990, Nucl. Acids Res. 18: 1901.

[0256] Plank, C., et al., 1992, Bioconjugate Chemistry 3: 533-539

[0257] Precious, B. and Russell, W. C., 1985, Virology, ed. Mahy, B. W.J., IRL Press, Oxford, Washington, D.C., 193-205.

[0258] Rao, L., et al., 1992, Proc. Natl. Acad. Sci. USA 89: 7742-7746.

[0259] Raviprakash, K. S., et al., 1989, J. Virol. 63: 5455-5458.

[0260] Roberts, R. J., et al., 1986, A consensus sequence for theadenovirus-2 genome. (in) Doerfler, W. (Ed.); Adenovirus DNA: 1-51;Martinus Nijhoff Publishing, Boston.

[0261] Ruley, H. E., 1983, Nature 304: 602-606.

[0262] Sarma, P. S., Huebner, R. J. and Lane, W. T., 1965, Science 149:1108.

[0263] Schnitzlein, W. M., et al., 1988, Virus Res. 10: 65-75.

[0264] Scott, S. D., et al., 1989, J. Gen. Virol. 70: 3055-65.

[0265] Shafren, D. R. and Tannock, G. A., 1991, J. Gen. Virol. 72:2713-9.

[0266] Sheppard, M. and Trist, H., 1992, Virology 188: 881-886.

[0267] Shinagawa, M., et al., 1983, Virology 125: 491-495.

[0268] Stouten, P. F. W., et al., 1992, J. Mol. Biol. 226: 1073-1084.

[0269] Tagaya, Y., et al., 1989, Embo J. 8: 757-764.

[0270] Taylor, J., et al., 1995, Vaccine 13: 539-549.

[0271] Traenckner, E., B-M., et al., 1995, Embo J. 14: 2876-2883.

[0272] Trapnell, B. C. and Gorziglia, M., 1994, Current Opinion inBiotechnology 5: 617-625.

[0273] Treanor, J. J., et al., 1991, Vaccine 9: 459-501.

[0274] Tripathy, D. N. and Schnitzlein, W. M., 1991, Avian Dis. 35:186-91.

[0275] Van den Ende, M., Don, P. and Kipps, A., 1949, J. Gen. Microbiol.3: 174-183.

[0276] Van der Eb, A. J., et al., 1980, Cold Spring Harbor Symp. Quant.Biol. 44: 383-399.

[0277] Vrati, S., et al., 1995, Virology 209: 400-408.

[0278] Wagner, E., et al., 1992, Proc. Natl. Acad. Sci. USA 89:6099-6103.

[0279] Weber, J. M., 1995, Current Topics in Microbiology and Immunology199/I. eds. W. Doerfler and P. Böhm. Springer-Verlag, Heidelberg-Berlin.

[0280] Weber, J. and Anderson, C. W., 1988, J. Virol. 62: 1741-1745.

[0281] Webster, A., Hay, R. and Kemp, G., 1993, Cell 72: 97-104.

[0282] Webster, A., Leith, I. R. and Hay, R. T., 1994, J. Virol. 68:7292-7300.

[0283] Weinberg, D. H. and Ketner, G., 1983, Proc. Natl. Acad. Sci. USA80: 5383-5386.

[0284] White, E., 1994, Seminars in Virology 5: 341-348.

[0285] Wold, W. S. M. and Gooding, L. R., 1991, Virology 184, 1-8.

[0286] Yates, V. J., and Fry, D. E., 1957, Amer. J. Vet. Res. 18:657-660.

[0287] Zheng, et al., 1993, Vaccine 11: 1191-1198.

0 SEQUENCE LISTING <160> NUMBER OF SEQ ID NOS: 54 <210> SEQ ID NO 1<211> LENGTH: 43804 <212> TYPE: DNA <213> ORGANISM: CELO Virus <220>FEATURE: <221> NAME/KEY: gene <222> LOCATION: (12193)..(15043) <223>OTHER INFORMATION: /gene: L1 <221> NAME/KEY: misc_feature <222>LOCATION: (15080) <223> OTHER INFORMATION: /note= L2 region penton basesplice acceptor site <221> NAME/KEY: gene <222> LOCATION:(15110)..(17495) <223> OTHER INFORMATION: /gene: L2 <221> NAME/KEY:polyA_site <222> LOCATION: (17526) <221> NAME/KEY: gene <222> LOCATION:(17559)..(21754) <223> OTHER INFORMATION: /gene: L3 <221> NAME/KEY:misc_feature <222> LOCATION: (18261) <223> OTHER INFORMATION: /gene: L3/note= hexon splice acceptor site <221> NAME/KEY: misc_feature <222>LOCATION: (21102) <223> OTHER INFORMATION: /gene: L3 /note= proteasesplice acceptor site <221> NAME/KEY: misc_feature <222> LOCATION:(21123) <223> OTHER INFORMATION: /gene: L3 /note= protease spliceacceptor site <221> NAME/KEY: polyA_site <222> LOCATION: (21767) <221>NAME/KEY: polyA_site <222> LOCATION: (21824) <221> NAME/KEY: polyA_site<222> LOCATION: (21836) <221> NAME/KEY: polyA_site <222> LOCATION:(21882) <221> NAME/KEY: misc_feature <222> LOCATION: (23608) <223> OTHERINFORMATION: /note= 100K splice acceptor site <221> NAME/KEY:misc_feature <222> LOCATION: (23649) <223> OTHER INFORMATION: /note=100K splice acceptor site <221> NAME/KEY: gene <222> LOCATION:(23680)..(27886) <223> OTHER INFORMATION: /gene: L4 <221> NAME/KEY:polyA_site <222> LOCATION: (27920) <221> NAME/KEY: misc_feature <222>LOCATION: (28315) <223> OTHER INFORMATION: /note= fibre splice acceptorsite <221> NAME/KEY: misc_feature <222> LOCATION: (28341) <223> OTHERINFORMATION: / note= fibre splice acceptor site <221> NAME/KEY: gene<222> LOCATION: (28363)..(31768) <223> OTHER INFORMATION: /gene: L5<221> NAME/KEY: misc_feature <222> LOCATION: (30511) <223> OTHERINFORMATION: /gene: L5 /note= fibre splice acceptor site <221> NAME/KEY:polyA_site <222> LOCATION: (31770) <400> SEQUENCE: 1 gatgatgtataataacctca aaaactaacg cagtcataac cggccataac cgcacggtgt 60 cactcgggtacaaattatga attcgatctt tggacttttc gacgcgccca gtgactgtac 120 tttattgcgccaattcacca cgcccgggag atttcgaaat tgctatttcc gtgcagttcc 180 gcattccgaagtacaattta accggtttta tgggtgttcg gtgtttttct agcttaatca 240 ttgtttttagacgacacagt gggtatctgt tttcgcttgg acttggctcc gctttgtgaa 300 aattcaactcgatccaacat tttccttatt gatggaaggc ttttattatt tgcacaacag 360 acatcgcgctatttacacag aacgcaaagt gctgtctttt ttattccttg ttccgggtac 420 atcttttattgctagtgcct cgcctatttt tagtcacgta tcttccttgt tctatagcta 480 tatattcacgcggttttcgg tctctcctca ctcggcagat gacttcggaa gagaagctgc 540 agagttcgtctccggagacc ggcctcgccg ctgtcgtcct gcaaagcccc cttgaggtac 600 gtgtgcctgccgttcttcct cctccagtgc gaattgacat ctaccgctac ccaggctttc 660 cgccaacggagaccatctgg cacggtctca tcacgcagac tgagttaaac caggctttgg 720 agagcatcgttgagcaattg tagtaagtgt cagtccctat ttttctgttt tttcttgtat 780 ttcctcttagacgatggacc cgttcggttc ttcttcagtc cctccgtgct ctacatcaga 840 ccttcccgaacccaagctct atttcgtccg cttgtcaccc catgcagtgc ctccagttag 900 ggctacgcacggagctgcag gatacgattt gtttagcgct tacgacatta aagtgcctgc 960 tcgcggtcgggcgctagttc ccacagattt agtttttcaa tttccgcccg gctgttacgg 1020 tcggattgctcctcgctcgg gcttggccgc caaatttttc attgacgtcg gagcgggtgt 1080 tatcgatcccgattaccgcg ggaacgttag cgtggttttg ttcaatttct ccgagagctc 1140 gttcaacatcaggcgaggcg atagggtagc acagcttatt ctggagcgta ttatggttcc 1200 ggagctgtcggaactgacgc agctaggcga gacagaccgc ggggcgtcgg gtttcgggtc 1260 cacaggtatgggtgctgtag accgcaatca gcgctctgtg ttggaatggc tgacccctgg 1320 ttcccgttgataggaccctt gcgacgaagt cgacttcgct gtacgcttgc aagcagattt 1380 tgagggggacgacgacacat acggacattg tctgtgcgaa ggcagcacca gcaagaggcc 1440 ctgctgtacgctctgtaaca gttactgcgt gcttctttag gtatgattgt gagcgtgtgg 1500 cagctcgcgaccatcgcgat tgagttttgc gataagtggg tggggaaata ctacagattc 1560 cgaccgtatcatgagagact tttgcttatg cagagtcggc aagctttgga aaggagcttg 1620 cgccgctgcgtaagtgaagt tggacctccg ccagagcctc tagaatagcg atggagtccg 1680 gcgcacgtggtgcttcctga ctttctgcac cattaccgtg gtgtttctgg ccttctttct 1740 gcagaaacttctcaactaca tagatttcag agatagcgac tgcacagaat gtttttttgt 1800 gtagttgacaggactatgga ccagacagca acccatacag ctttgattct tctgatcgtc 1860 ttgacggtgttcacgggcgc ggtggtagct ttgatgttgt atattgcgat aactggactt 1920 ccttgctctatgctttgctc tcaataaaga ttttcagaat ggtgattcta tggtattttg 1980 tcttttttctagacagtcat gtcgcgtgag tctgaacgtt actggacttt ggtgcacgct 2040 ctgattgatcggggtgtagt cagccgtgaa cagtggcaaa tggttgaccc tgcgcaatac 2100 cagttctaccaccgctccaa acagaggggt tttaaggtcc gtcacattct tcgtgatgtg 2160 attcgccacatgtgttggtc tcggactctg ttagattata tgtcctcggc ctcgacacct 2220 agtccggacgatgtattacg caatcccctg tatcagttgc tgttatgtaa tggatataac 2280 cccgctgttgtggggacagc gctgatccgg tgggcgggcc atcagagcaa ccgtaacact 2340 gtttggattcgaggcacccc tatgtccgga gctccgtact tggcacaggc tatcgcgtac 2400 tgctctcccctcgtagggag cgttgattgg cgcaacaagt ctaacccatt cgaagggtgt 2460 ccagatagtctggtgttttg gtgggacggc ggttatgttt atgattgttg tgtgggtctg 2520 gtgaagcaggtgttccgggg agaacatgtt attttgcctc ctgagggctt gcgtggcccc 2580 aacccgtgctctgaactctt caggacccca gtcttgatgt acagccaggc ggatatttgt 2640 atgactaggctgagatcagg ggaactaagt gcagagcatg cggtgggcct cagggattgt 2700 atgtacctgatccgtttgac agaagatttt gactgcgcgg gtggtatatc gtgtgcagat 2760 gtcaaacagtttgtggcgtg gagccgcgaa caccctgggg aggttcgcga gacccacgaa 2820 ctcaaataaaaattcgggac ttctgtgtac gttccttttc atgtttatta aacactgttc 2880 tttcgagtgagtcatatcac gtggaagtta attgcgactg ggagccgcag aagcaggtgg 2940 taaaagcaagctatgcaggg atagtttacg atgtcccttg gaagacagac atagagtgtt 3000 tgtccttccagtcgcgtgta ggcgccttgg cgccgcgcaa cggttactgc tgtatatatt 3060 tgttcgagggcagggttcaa gttgcataac ctgcagtgag tagcatatgt gtataagatg 3120 agattaagagggagttgggg ctggtggtcg tgttgagcaa cgaaattgac gaccgttttg 3180 gcccagtgagctgtaagcac tcggatgagg gataaacaaa agagggaacg gggtgtctcg 3240 cagctgcagatgtactggat agtccagtcg gtacatcgca tctcaataac tcttatccag 3300 ctgcagaatgacctgagctg gaggtcacac tcttcgtccg taatgcagtg gggcgagtgc 3360 ttgaaggggtacatctgtct tttaaggaga aagagtagga aatcgggatc tgtgattagg 3420 gtaatgcccacgcgcgtgaa caggggctcg atgtagatat gccaactgtg ttggggctcg 3480 tcctcatcgttgtcactatc ccaaacaagg tcgcacgact cgaatgtctg taaaacatca 3540 aaagtttattactgattttg aagaggggta cgtatacagg ggttacagag tttattatcg 3600 cgtttgcaatgcagtgtcat taagcaagca caagcggcag cattagcaac aaagtcgcgt 3660 agctctttgcgcgggtggta gcatggaggg gtcttggtgt ggtctcctgt gaggaagaag 3720 acgaccgtgttccgcgccgt gaactcgttg gcgagcgaga ggtccgcgca gtggattagc 3780 atgggggtcgaggggatgtg gaacgttttc ttattcagcc tgcaggtggc agcattgcct 3840 tccataaagtgcagcatcaa cgggtccggc ttctcgtcta cgaagggcag acagtataga 3900 gaggcgtggggtgcgatttc gcccagtgat agtatggagt tgattcggct gtcggagatc 3960 gcgagcgggaaacacgcgca taaggcatta aagagactct tggcgttgga aagtcttcct 4020 ccgcagagcacgagggtgtt tagttctcct cggagccatc ggaggatggc gttagctgcc 4080 acttttccctcgtagccttc ggcttccatc atctgatgga accagttgtc gcagacgttg 4140 tagaacgcagccttgagggt gtcgggctct tcggagggta actggttaat gaggctctgt 4200 ccgggtccgaaccgttcctt gagggcttcg agtacacgct ctcgttcgta ggtgggtatg 4260 aggttcttggacgggaagtt ccactggact tcgtgtgcta caccttcgac aaacaaccgt 4320 tcaaccaactgatccatggg ggacgtggac ggaaataagc agggtgaagg ggtgtctgga 4380 atgcaggatgctatttgctc ctcttcgcta cttaaagacg actgagactc gcgtcgcctc 4440 ttggtgcactgtgggagaga attatatgtc atgtgatgag tgacaagcat tgcatgggga 4500 ggtccacgagcgggaacaga gcattccttt gttctcgcat aggagagcgc gcaaggcgtc 4560 gttgtcattatctgcctgta attggtgtat gcgatgaatg tagcgggaga gctctcccag 4620 aaagaacccgcagccgttgg cgggttcgta gaacctaata atgatatgtt cgtcgggagc 4680 agtgggcagatgttcgggat gggggagctg atataagcgg acgaggcatt ttttctcact 4740 gatgtccacgagaccgtcgc gggtcattgt ggtgaatgac ctgcctttca gcagctggtt 4800 agcaacgggatctttgaacg tgttacaatg agctgtgcag gggaaataaa gcatcttgac 4860 taaatcttgctgtctggcgt actggccaaa gtcgaacgcg ttaatgttgg cagtgaggac 4920 aaattggaataatctgagga tggatgcagt gaatatatct gcggcagagg aaggttctcc 4980 tacgacatagagcgtgttag tataatggca caggcgacta atgctcatgg cagacatcca 5040 cagataaatggcgtagatgg ctaccggagc attgtaacct tcggcttcta gcatggctcg 5100 gagatctctttgtgaggaca gctcgaagcc ttgcagattg aaattgatcg tttgactgag 5160 cccgtagctgcagtataatt tgcgcgcttc ttccagaagc gcgggggcga ccgattcgaa 5220 tagttcagcatcttggggat accgagtgag ccaatctttg taggtaaaga tgttgtttcg 5280 ctggagatcgaatattagcc tgtgtgtgac taaatcgtcg tcgtcttctc ccgtgtggcg 5340 gttcctgagtctcttaacgc tatccttaag tttcctgaaa cgttcctggg ggagaaagtt 5400 tttcgagatcggttagtttg gggtttctct ctaactgctt tctatgtgtt tcgctaaatt 5460 tgatgatagatttggctgtt ttcaggaatc tctcctgaat gttaacattg agagggatga 5520 tcgagtactgctgatctata ctgcaccacc taaaggactc ggatactggg tccggagtgt 5580 acatgacccagctgaaccgc tggttctgct gcaggaagga aaagtaagct ttgagaagga 5640 caacgaggtccttcgagatg ttgtgtgcga acgcatagag gaacctagcg aactggaatt 5700 ggggaatatgacaggatagg atgtgcattt tcgcgttcac tttgagggtg ggaacgtttc 5760 ctatcgcggtgcgcggtgcc atgttgtgca agactacgaa aatgtagaag gctgtacagt 5820 gggcagagcgagcaaagagc ttagaaggaa gggcgtgaaa gaggacagag acgctggagc 5880 ctgaacagagcttatccatg cactcgtcca tgataatggc gacgggtccc cgcttggaga 5940 ctttcacgtaaatgttgtct ggatggtcga tgttgagatg ttcgggtgcg gtggcctcct 6000 cgtaagtcatctccataaat tcgggacgga acgtgcttgt cttaggggcg atggtgccgt 6060 cttccctgcagtcgaaattg gcctcgacca gctgcaggtt ccaggacgtc tgttcgatgg 6120 gtggaatcatgttcttttcc ggagtgatga agatgaccgt ttcggggatc gggtccaaca 6180 tgttgcacgagatgagcgcc cgcagcagat gcgacttgcc ggatccggtg ggtccgtaga 6240 tgaccccgatgatgggctgc ttgcccatgt tgatggacgg cagctgtccg tgttggaggt 6300 agcgactgtccatctgttcc tgggctttaa cttcttcgtg ggcttcgaga aacttctcat 6360 taacgccccctaggctgtag aactcgtcgt aggaggggaa gtgttgctcg cggaagagct 6420 gcggtgctaggtcgacagct ccgtcgtacc agctggtgac gcgctggtag aagtcccgcg 6480 aggcttcgtactcttcttcc tcatactccc aggctttccg cttcctggga gctatcatct 6540 gcgaagagtaggtcgtgaac ttgcccgcat tcctcttcgg ataggaacgc gtagggttcc 6600 catcgtaggggtgcgagggg gtcttcgtgc ccgacgattg ggacgggtcg tacgtctcct 6660 gtcgtgcggggatttgggtg ctcattgtcg taggggtaca ggtagtcccc gtgctcgtat 6720 agggtgaggtccttccacgg acgcagcact cgcgtgagct gctcgttgtg aatggtgaaa 6780 gggtcgtagcgactgacctt gttcagaagc gtggttttaa agatggttct ccgcgtgtgt 6840 agctctgggatgttgctctg cgctccgaat tgcacgtcct cgtgtctacg ccaacagcgc 6900 agcagcgtgtcgtagatgag ttcggactgc ctgtgtccct tcgatctgat tttcccgggt 6960 cctaccgtgcggcactgttc gttgacgcag attgagtttt tcagtccgta cagttttggc 7020 gctaggaagatggtttccga gctgtacgtg tcacttccgc aggctttgca cttgatgtcg 7080 caatcgcaggcccagtagag gccgggattt tctggatcga aagtcagtcg agtggattct 7140 gttttgattcggtgcgcgcc gcggcttttc atgcgatgat agcctgtttc tgtgacgaac 7200 aggctgtcggtatcgccata gaggctgcgc ggctcctccc ttcgcaggat gtgcactcct 7260 ctgtccggtccgtacaggat gtcacaccac tcgctgaaga aggccctcga ccagcccagc 7320 acgaagcaggcgatttgcgt ggcgtatctt ttgtttgcca cctgcttgtc caggctttcc 7380 agatggagcacggttagggc ttctggtgtg gcttcgagga gacgcatagg tttaaaccgg 7440 gtctcgttagcgcgagcgta gtgggcggag cttagctccc cctcgggtat ataagggccc 7500 gtcgcgagggtcgcctcgac ttccaggtct acggcgacga gaggatcgtc ttcggtaagt 7560 gcgtttttcggcttacgggc actctcgcga tcgctgtcgt ctccttcttc ttccccatcc 7620 tcttctgcttcgggctcttc ctgttcgggg tcgtctgcgt agcggaactg ttgtcgtaga 7680 ctctcctcactgaaggggtt aggcgcgttt tcgagggtga cttccgttcc gttgaacgag 7740 tcgtcattgagcagcgtgac gtgtttgacg atttcagtgc cttcgtagat gtttttctta 7800 tctgcttccgagaggtcctg ttcaaagatg atgcgcgtgg tgtccatgtt ggtggcaaac 7860 gcaccgtacagcgcgttgct cagcattttg gagatggatc gaatcacttc gttcttctcg 7920 cgatcggctttttctttggc gaggatgttt ttcgtgacgt agtcggcaca tagcgttttc 7980 cattccggaaaaacaatgtt catctcgtca tggaggacct ggactcgcca tccccggttg 8040 tgcagcgtgaggatatctat gacggtgacc acctcgtcgt agagagcctc gttggtccag 8100 accagtctgcctcccctccg ggagcagatg ggagggagtg ggtctaacat ttcgggcggg 8160 ggagggtaggcttctatttt caggatggaa ggcttgatac gcgcatcgaa gtagctcaga 8220 tgcgattcgttggtcagcag ccggttgagc tcctccacgt gctgcgcggt aaattttgga 8280 tctaggggcattccgtgggg catggggtgg gtgagggcgg aagcgtacat gccgcagatg 8340 tcaaagacgtagacgggttt caggtaaggt ccgagcacgt tggggtagca tcgtccgccg 8400 cggagcgcttggcgtatgta tttgaacatg gggcggtggg gggcgtagac ttcggccacg 8460 tagtcgggggagggttgttt tttctttttg gttcgacctt tctttttggg gggttcgggg 8520 acggagggagggcatgtcgc acgctgttcg cggacgtaat tggaaaaggt aagttgcttc 8580 caaaaggcatgagtgttgct ggggatggtg ggccgcacga agatgttaaa atggccttcc 8640 atccctagttctcgttggaa ataggcgtcg tagctgtcgt gtaacgtgtg ggccagcttt 8700 tgggtgacgcggacgtcctg catgcagtat tcgaggcacg cttggacgat gtcgtactgc 8760 tggcccgggtgttctttctg ccatagattc ttctgttcag cgatgacgga tgggtcttcc 8820 cagtacctttcgacaggaaa gccgtcggcg tccgcgtgaa agcgccccgt ggaaatgaat 8880 tcgttgatggcctcgtatgg gcaatgtccc ttgcagaggt ctagcgcgta ggctgccgcg 8940 gctttggagagtttggcccc gctggtgagc tgtagagtgt ctcgcaccat gaaccgcaca 9000 aataccgagcgcgcatcctg atgggacacg atcccgcgag accagcgttc tacgcgggag 9060 gcgtctttcttcacgtagtt ggggtttggc atgcggaaaa tgatatcatt gaacagaagg 9120 cgaccgacgcgaggcatgaa ggatcgatca catttgcacg cttccgggaa taggtccctg 9180 cgctcgacgagttccgtggc taagaggagt tcatcgaact tacatatgtt gtgacctagc 9240 actacgatgtctacggaata aaagttatcc gggagggaga ggggggaggt gggtttctcg 9300 aagagctcgtacggtatttt gtgaacggag ccgtactttc cgtccttgac tagctggtcg 9360 caataatctcggttggcagc cgcgtagcgg tcgactagat tttgggcaaa ttctatttgc 9420 agacgggacctgaagttgcg aaaccttctg gcaacctcgc ccggatggct gtctagccaa 9480 tagaagccttcgtcgagggc tttgagacgg tcgtcctgcc gtgctaagcg ttcggcgcgg 9540 gagaccagctgggggtctcc gctgagcatg aagcacaaca taaacggatg catgcgcttg 9600 cctttcttttcgaacacggt atacgtctcg atatcgtagg tgatgtacaa ctgacgtatg 9660 tgggggtgttgggctggaca ggagaagtgg acgtgctgcc acaaatcgct gcccgatccc 9720 tggacggcgtggtagtagaa ggcagagcgg cgttcgttgc acgagtgtcg tctgacccag 9780 tgtcggccgcaggtgggaca ctgttgaacc ggagtgcggc tggtgatcca gacccattct 9840 ccggtgacgtttttggcgat gagcatgggg ggaagggcgg gatgttcgga gatgtgcacg 9900 gtccgcaggcgcgctgtttt gcctttaaag ttgatgacgg tgacctgggc gggcttaaag 9960 tcggggagggagcgttcggg gtctcgggca tagtgcagat agtctattcg gtcgtagcct 10020 ctggctctcttagacagcag gaagcggtgg gtgcgtagga atttcttgag acctatgggg 10080 aatacagtgggttggaagcg gaagggctgt tcctcgatgt aatagacgtt tgtgcgtagg 10140 gggttgccccggaagctctt gaagtacttt cgggtctctt ccagccgttt agcggaaatg 10200 gttcggatgacgtgcgaatc ggggcccagc acgctgtccg tgaggggagg ccccgcgtct 10260 gcttccatttacagaggctg acctcgtcgc gccgccgctt cgatgtcttc gcggcggtgg 10320 cggagcacgcgtctgaggtt ggctttgatc tcctcgttgg tagcgatggc tacgatgcct 10380 ttaaatttgatgcggaaact gatgtcgatg ctatcgatca gctcttcgct gaggttgagc 10440 tgcttcagcacttcttcgat gtcaccggag cggtctctgt attgaatatc agagagaaac 10500 agttgttggtccgcatcgtc catgccttcg aattgacccg tccgttcgac catcataaga 10560 aaatcgcgtaatatacgttc ccacagggtt tcgaatatgg tggcggggtt ggattgctcg 10620 ctccatatgcgtttaaaaac ctgctgcgcg ttgacgtccc atcccacgac gagtacttgt 10680 aaggtcaggacgtcgacgta ccggcggaac tcgcggttgg cgatgaagtg gctgtacagg 10740 tagtagagtgtagaggcgat atgttcggct aagaagaagt acagcaccca cttgcgcagg 10800 aacgactctcccatgagtcc cgcgtcgcgc gcggtcagga gcatgcggta gaagtcgttg 10860 gcaaacctgaagagttcgtg tctccgggcg gccccgctga gctcgtcttg cagtgccccg 10920 atggcttcgagcgctgtgcg aatcacctcg tctaacagct cttcttcctc ctcttctggt 10980 tcttctacttccatggcggc ttcttctatt tcttcgggag ggggcgcggg ggaggggggt 11040 ctccgtcgccgccgcgtgac gcggggcagg cggtctacga aggcccgcac ggcacggggc 11100 ctaatgcggcgcgcttcgga cgcggtgatg gctctgccgt ggcggtctct agggcgcagg 11160 ccggtgcggtcggttacccg ccggctgcag agggtgatgg cgcctcctga cagacccgtt 11220 cctgctaaggaaacggcgtc gctcgctaga gtgctcatga aacatctcgc cattgtttcc 11280 gtaggttcctctacgtcccg tcttcctcga aacttgcgct cggcatcccg caccagttgc 11340 tgggtgtctagctccgcgaa ggcttccgcg aagtaggaga gccagtttgg ttcaaggaac 11400 acatcggatccaggtaagaa ccgatatctt tcttccgttt gaaatagatc atatgcgtag 11460 cagaatagatagtggcagag ggcgactctt agttggcgga tggcggcgag cagttcggcg 11520 tcggcggcagaggccgttcg catgaggacg ccttcttgca agccacccgc tccgttgccc 11580 gacagtaggctgtggtgttc ggcgtcgagc tgcggcaaca cttgtccgtg acggcctacg 11640 tcgattcccctacctcgcag atgcgcccgg cccatgtccg cggccacgcg gtccatgagt 11700 acggcgttgtgcatctgcgt gaacgtaccg tgaaagttgt cgagatcgag aaatcgcatg 11760 tactgccctacattgacggc gtaggagcag tcggtcagac aggtccaaaa gaggcgtctg 11820 ggtctttggggtgggctatc gtaacctatc tgcataaaaa cgcggttgtc gaaaaggtaa 11880 tcgttgagcgcgcggtgcat agcttggtac ccgaggagaa ggtgcggcgg cggcaatccg 11940 ttgtagggcggcgcggcgac gttcggcgat cgcggcgcga ggtctcggag ttgcattaaa 12000 cggtagtcgtacacccggct gacgagaaac acgcttcggg ggtggaccag ggctggttct 12060 tcccgaacgaggacatagtc agcggtgatg acgggttcgc agaagcggac ggtggctaga 12120 ctctgtccggtgagatccgc gaagactctg taagcctgaa attgagcccc tgacgttttt 12180 agaccgctcgtaatgcaccc cgtcctgcaa agcgttcgaa acgcgagcgt gagcgccgga 12240 ggaccccatcaacagcaacc gcagcagcaa cagcacggtg tgtcgtcggt ccgtcgtcct 12300 ccttcaccaccccgatatcc cgcacaacat gcctatcccg gcgcgggcgc gacacccacg 12360 gcaggacgaggcgatttcga cggcgcgctt gatcccgatg aaggaccggt cgcgtgcggg 12420 ctggcggccggggccggtgt ggacgaagtt agaatgaggg agcgggacgc cgcgcggcga 12480 gccacggtgcccgagatcaa tctttttaag gctcgacgtg acgtggtgcc caatggggat 12540 tacgagagggatctgatgta ccactcggga caggcaatcg atatcgatcg gcaacgtgtg 12600 ctcactccggaagactttaa ggggtccgag ccggctttca cgccggctgt caaccatatg 12660 cgcgcggccgagttgaagag ggcggctgag cagacggcat ttggggagga attgaggaat 12720 acctgccatcagacccgcat ccgcacggct ctgttaaggc ccgagatcgg agcgggaatc 12780 tactatctgtacgatttcgt ccagacttat ctggagcatc cggacggtcg ggtgaagctc 12840 aatcctcagctggtgttggt ggctcagcac gcgggcaata ctatgctggc gcagcgcttg 12900 tgggccatcgcagaggagaa gaatgcgtgg ttgagagatt tgatagagat ggcgtacatg 12960 atcgtgaacgatccgtacct caatacggag cagcagctgt cggccatctg cacgacggtg 13020 gtcgagttgagcatgaaata cgccaagttg gccgccaaga acggttaccc gtccatggcg 13080 cagatggctaaggcgcagga atttttctac cgggtcatgc aagcggtgct cgatttaggt 13140 gtccaagtgggggtgtataa caaccgacca gctcggtacc gtcagaagcg catgagcgag 13200 attccgcagatgactgacgc cgagtacatg ttcggtttga cccaggcgct ggagagcagg 13260 cctccgcagggcgaatcttt tgccgacgag gggccgtcag aatcggacga cgaggatgac 13320 ttcatctgatacgtttctgg ctcttgcgcc ctacgggcgt caggaggtgg cggacgccct 13380 cagttcgctcccagatggca aggacgcgcg gtcgctacgt catgcaccct acgcaaatcg 13440 cctcatcaaactccagagcg ccatggtgcc tccaaaagtg gacggtactt ccgagcgggt 13500 ggccgaaatcgtgaaagggc tagccgagca aggcgccatc taccccgatc agatgggcgc 13560 gatccactcagatttgctta atcgagctta cacgtggaat tccatggggg tgcaggagag 13620 catccaggcgctggtcaacg acgtgatcca cggacagaac cggacattgc aagacgagct 13680 tgcgcggacgaaagaaatag cgaatgcttc gctcttgacc caatttttcg attccctgta 13740 caaaacggtggatcgtgggc agcgaaattt tgagggcttt aagaaacttt tgcgtctttt 13800 cgtgaataacgtgccgaatg ccgaagtgta cgggtcttcg gggtccttta gcgtgcagat 13860 aaatcttggcggatctagtc aaaacatcaa tctgaccaat gcgtttgaga atttgaagcc 13920 gatatggggcgcacggtggg acgcggtgaa taatcctcgc atcggggcgc ttctgacacc 13980 caacactcgagcgttgttgt ttttcgtgag ctctttttac gactacgggg ctatggagcc 14040 cggtagttacttggacaata tcatgaggct gtacaaggag gctatcagag ccgatgtgga 14100 cgcggagggtgatgccatta tggagctcgg ggaggcgggc gcaaatctca acttgcggtt 14160 caacgattacaaggacacac taaactacct cctgcaaaat cgagaggttg tacccgacac 14220 ggctccgctggagctgagcg cggagcagga aatgctcttg aagtacctga tgaggcaact 14280 acgacaggctcttaaggacg gggtcccggc ggacatttct atcagtacca tgactcagta 14340 cctagatcctaggctgtatc agacgaacaa ggtgttcgtg gagaaattgc aaaactacct 14400 gttggcggctcaggcgcgca atcctgtgta ttaccgactg ttggtgctgg accccaactg 14460 gcggcctccggcaggcctat atacgggtaa ttacgtgata cccgaccgct acgactttga 14520 ggacgtgcagagcgagcttg aatacgcggg tccctccaga gacgagtatt tcgatgattc 14580 tttgttcgcaccaggtcctc agcgccgctt aaattcggcc gaggaggctc aattggagcg 14640 tgacatcgaatctttgaccg gccacattga cgaagagctg ggcgtccaat ctcaggctgg 14700 ctggctcgccgatcaccgcc tgcctgtcgc gttcgatggc gctctcagcc ttaccgaacg 14760 caacgcctacaacacgccgt tgccccccga ttcccacatg cgtagccgtt ctagctccgt 14820 cgctagcgatcttgggctat tgaacctatc tgggacgggg ggaccgggct ttttcgctag 14880 tctgcggccttccatcggca gccgtcaacc gaccggcacg gccgtgggcc tccgcccgac 14940 gacaccgtacagcggttcgg ggtgtatgag gggcaccggt ctggcgcgca aagttttaaa 15000 cccggccgcgtcgcgccggg ggcgcaagct acggttctac tgaaccctag actctgacga 15060 agaaacttaaaaacgcttac cgccatttcg ccgcgcagaa gttggaagga tgtaccggag 15120 cctgcgaccgccgacgtcga ttcctcctcc gcctccctct ggtccctcgc cttatccggc 15180 gatgatcaacggatatcccc cggatgtgcc ggtggggtca cctgccaacg gagatgcgga 15240 gctgttcgtgccgctccaga gggtgatgcc gcctacgggt ggacggaaca gcattagata 15300 ccggaattatgcgccgtgcc aaaacaccac caagtttttt tacgtagaca ataagctgag 15360 cgacttagacacctacaacg aggacgcgaa tcacagcaat tttaggacga cagtcattca 15420 taatcaggacttagacccgt caacggccgc cacagagacc attcagctcg acaataggtc 15480 gtgttggggcggagagctaa aaacagcggt gaaaaccaat tgcccgaaca tcagctcgtt 15540 tttccaaagtgatacagtgc gcgtgcgtct gatgagcaag cgcgatccgg ggggtaccga 15600 cccagacgcgggggtgaaca acccacccgg ggccgagtac aagtggtatg atctgaggat 15660 tcccgaaggtaactacgcgt tgaacgagat cattgacctt ttgaacgaag gcatcgtcca 15720 gctgtacctgcaggaggggc gccaaaacaa tgtgctcaag agcgatatcg gggttaagtt 15780 cgatacgcggtatctggatt tgctgaagga ccccgtgacg gggctggtga cgcccggcac 15840 ctacgtttacaaaggatacc atcccgacat catcctcctc cccggctgcg cggtcgactt 15900 tacgttcagcaggcttagtc ttctgctcgg tatcgcgaag cgcgagccct actcgaaggg 15960 gtttacgattacttacgaag atcttcaagg agggaacgtg cccgcgctgc tcgatctgtc 16020 ctccgtgcaggtagacgatc aagacgagga cgtgatcgtg gtggcagacg caaggcctct 16080 tttaaaagactccaagggcg tttcctataa cgtgatcacc actggcgtga ctcaaccgca 16140 aaccgcttatcggtcttggc tccttgccta ccacaccctg gactcccccg cgcgcaataa 16200 aacgttattgactgttccgg atatggcagg tggtatcggc gctatgtaca catcgatgcc 16260 ggacacgtttaccgcacctg ccggatttaa ggaagacaat acgaccaacc tttgtcctgt 16320 ggtggccatgaacctgttcc cgagtttcaa taaggtattt taccagggcg cgtccgccta 16380 cgtgcagcgcttagaaaatg ccacgcaatc cgcaacggcc gctttcaacc ggtttcccga 16440 aaacgaaattctaaagcagg ccccacccat gaatgtttcc tcggtgtgtg ataaccaacc 16500 cgccgtcgttcagcagggtg tgctaccgct gaagaattct ctgtctggcc tacagcgcgt 16560 gttgatcaccgacgaccggc gccgtcccat tccatacgtg tacaaaacca tcgccaccgt 16620 gcaaccgcgcgttttgagca gttcaaccct gcagtgagga gcggaaggat tttcaaacat 16680 gtccattctgatttcaccca gtgataacag aggttgggga gcaaacatgc gttaccgccg 16740 tagagcatccatgcgcgggg tcggtcgccg tcgtctcacc ctgaggcagc tattgggtct 16800 ggggtctcgccggagacggc gatccaggcc cacgaccgtc agtaaccgtt tggtggttgt 16860 gagcacccgccgccgctctt cccgaagacg ccgatgaagc aagcagctga tgagatgttc 16920 ttctgactatgtgtgccgtc gctatacaca ggagcgacgt cgttatgcct tccgttcttt 16980 tgaccggcgggcggaccgcc aagggcaaga agagagcctc tcgtcgtcga gtgaaagtgc 17040 ctaagttgcctaagggagcg cgccgaaagc gtgcgtcggt gacgccggtc cctaccgtag 17100 ctaccgcgaccgcttccgag cgcgcggctc tgacgaacct agccagacgg ctccagcgcg 17160 gcgactacgccgcttggagg cccgccgact acacgtcacc ggccgtttcc gaggcggctc 17220 gcgcagccgcctcgtccggc acccccgcga ccgcgaggga tctcgcgacg ggaaccctcg 17280 ctcgcgccgtgcccatgacg ggtaccggcg gaaggcggcg caagcgcacc gctacccgcc 17340 gccgatctctgaaggggggc ttcctgccgg ctctgatacc tatcattgcg gccgctatcg 17400 gcgccattccgggcatcgca ggcaccgccg tgggcatcgc caatctgaag gagcagcaga 17460 gacagtttaataagatttac ggggacaaaa agtgatgctg actggacgca ctaaaaggcc 17520 ctttcaataaacgcgttttt gtagaaccgg ctcgcgtcat ggactacgct gcgctatcac 17580 cgcatctcggtgggtgggcc ctgagagacc accacatcgg cgactctagc ttgagagggg 17640 gagccatcaactggggcaac ctcgggtcgc gcataaccag cgcgctgaac tccaccggtc 17700 gctggctgtataacaccggc aaccgcttcg tgcattcgaa cactttcaac cagattaaac 17760 aaggcatacaagacagcggg gtcatacgca acgtggctaa tttggccgga gagacgctgg 17820 gggccctgaccgacatcggc cggttgaagt tgcaacagga tctggagaag ctgcggcgta 17880 aagctttgggggaggaaggt ccagcgaccc aggccgaact gcaggctctc attcaggccc 17940 tgcaggcgcaagtggctgcc ggagagccgc ccgccgcacc cgcggcgccg gcgccggccc 18000 cgccgctcgtgcccaccact cgtcctattc ccgaaatggt aacggaggtt aagcctcccg 18060 ttacgtcttcggcgccagcc gtccccgtag acgtgccgac cacgctggaa atgcgacctc 18120 cgccgcccaagcgcaggcgc aagagggcac gaccgggaca atggagggca cgcttggaca 18180 gcctctcgggtaccggagta gcgaccgcca ctagacgtat gtgttactaa aattccgtcg 18240 ttccgctatgtctaattttt agctcaccgg ttgtctcccg aaggcgtcat gactgcgctt 18300 actcccgacctgaccacggc gacgccgcgg ctgcagtact ttcatatcgc gggccctggc 18360 acccgagagtatctatccga ggatctccag cagtttatct cggccacggg gagctacttt 18420 gacttgaaaaacaaattcag gcagacggtc gtagctccca ctcgcaatgt caccaccgaa 18480 aaggcacaacgtctgcagat cagattctac ccgatccaga cggatgacac gccaaacagc 18540 tatcgcgtgcgctacagcgt caacgttggg gacagctggg tgttggacat gggggcgacc 18600 tacttcgacataaagggtgt gctggaccgc ggaccttcct tcaagccgta cggcggaacg 18660 gcttataatccccttgcgcc aagagaagct attttcaaca cctgggtgga gagcactggt 18720 cctcagaccaatgtggtggg acagatgacc aacgtgtaca caaatcagac caggaacgac 18780 aagacggccacgcttcagca ggtcaatagc atctccgggg tggttcccaa cgtcaacctg 18840 ggacccggcctcagtcaact agcatcccgg gccgacgtgg ataatattgg cgtggtggga 18900 cgtttcgccaaggtagactc agcgggcgtg aagcaggcgt acggagccta tgtcaagccc 18960 gtgaaggacgacgggtctca gtctctgaac cagaccgcgt actggctgat ggacaacgga 19020 ggtaccaactatctgggtgc cctggctgtg gaagactaca ctcagaccct gagttacccc 19080 gataccgtgctcgtgacccc tcccaccgct taccagcaag tcaactccgg caccatgcgg 19140 gcatgcaggcccaactacat cggcttccga gacaacttta tcaacctact gtaccacgac 19200 tcgggcgtctgcagcggaac gctcaactcc gagcgctccg gcatgaacgt ggtcgtggaa 19260 ctccaggacagaaacacaga actgagttac cagtacatgc tggcggacat gatgtcccgt 19320 catcactacttcgcgctgtg gaaccaggcc gtcgaccagt acgaccacga cgtgcgcgtc 19380 ttcaacaacgacggctacga agagggcgtg cctacttacg ccttcctgcc cgacgggcac 19440 ggggcgggcgaagacaacgg tcccgacctc agcaatgtca aaatttacac caacggacag 19500 caagataagggcaacgtggt ggccggaacg gtttccacac agctcaattt cggtaccatt 19560 ccctcctacgagatcgacat tgctgctgcc accaggcgca acttcatcat gagcaacatt 19620 gccgactacctgcccgacaa atacaagttt agcattcgcg gtttcgaccc tgttacagac 19680 aacatcgaccctaccaccta cttttacatg aatcgcaggg ttcccttgac caacgtggta 19740 gacctgtttaccaacattgg tgccagatgg tccgtggacc agatggacaa cgtcaatccc 19800 ttcaaccaccaccgtaactg ggggttgaag tacaggtctc agctgctcgg aaacagcaga 19860 tactgccgtttccatattca ggtgccgcag aaatactttg ccatcaagaa tctgctcctg 19920 ttgcccggcacctacactta cgagtgggtc ctcagaaagg atcccaacat gattctgcag 19980 tccagccttggcaacgactt gcgcgcggac ggcgcgcaga tcgtgtatac cgaggtgaac 20040 cttatggccaatttcatgcc catggaccac aataccagca accagctgga gctgatgttg 20100 cgcaacgctaccaacgacca gaccttcgcg gactacttgg gcgccaagaa cgctctctac 20160 aacgttccggccggctccac gctgctgacc atcaatattc ccgccagaac atgggagggt 20220 atgcggggctggtcttttac ccgcctcaag gcctcggaga cgccccagct gggcgctcag 20280 tacgacgtcggtttcaagta ttcaggctcc attccctatt cggatggcac cttttacctg 20340 tcccacacgttccgcagtat gagcgtgttg tttgatacct ctatcaactg gcctggcaac 20400 gaccgtctgctcacacctaa cctgttcgag atcaagaggc cagtggccac cgacagcgaa 20460 ggcttcactatgtcgcagtg cgacatgacc aaggactggt tcctcgtgca gatggccacc 20520 aactacaactacgtgtacaa cggttatagg ttctggcctg acagacacta cttccactat 20580 gacttcctacgcaacttcga ccccatgtcg cgtcagggcc ccaacttcct ggacaccacg 20640 ctgtacgacctggtgtccag cactcccgtt gttaacgaca ccggctcaca gccgtctcag 20700 gacaacgtgcgtaacaactc cggctttatc gcccctcgca gctggcccgt atggaccgca 20760 cagcagggcgaagcctggcc cgctaactgg ccgtacccgc tgatcgggaa cgacgccatc 20820 agttccaaccaaaccgtcaa ctacaagaag ttcctgtgcg ataactacct ctggaccgtg 20880 ccgttcagctcggactttat gtatatggga gagctgaccg atctgggtca gaaccccatg 20940 tacacaaacaactcccatag catggttatc aactttgagt tggaccccat ggatgagaat 21000 acttacgtgtacatgctgta cggggtattt gataccgttc gcgtgaacca gcccgagcgt 21060 aacgtgctagccatggctta cttccgtacg cctttcgcca caggcaacgc tgtgtaaaaa 21120 aaagacggctgggatgtcgg gaaccaccga gacccaactg cgggacctgc tgtcctctat 21180 gcacctgcggcaccgcttcc tgggtgtttt tgacaaaagt ttcccaggat ttctcgatcc 21240 gcacgtgcccgcctcagcta tcgtcaacac cggctcccgg gcctccggag gtatgcactg 21300 gatcgggttcgcgttcgacc ctgccgcagg acgatgttac atgtttgacc ctttcgggtg 21360 gtcagaccagaagctgtggg agttatacag agtcaagtac aacgctttca tgcgtcggac 21420 cggcttacggcagcccgatc gctgttttac cctggtccgt tctaccgagg ccgtgcagtg 21480 cccctgctcggccgcttgtg ggctttttag tgcccttttt atcgtctctt tcgaccgtta 21540 ccggtcgaagcccatggatg gcaatcccgt gatcgacacc gtagtcggtg tgaagcacga 21600 aaatatgaattctccgccct accgcgacat cctgcaccgt aaccaagagc gcacctatta 21660 ctggtggaccaagaatagcg cctattttcg tgctcatcaa gaggaactcc gacgagaaac 21720 ggcccttaacgccctacctg aaaatcacgt ttaatgaccg actgtaaata aagaacgacg 21780 cacacacgtactgtacatat ttgtgaatag agcaaccgtt tattagataa acgtcaataa 21840 atgccgaccgatagaccgac aaggctcttc actggcttta tttaaagaaa caaaaggatt 21900 aagcgaacgggtcgtcactg gcgatgggcg agactggcgc caacacctcc gttttaaagg 21960 cgtacgactcgttccaacgg aactcaggca catgtgtggg ctctgaagaa cccatcacgg 22020 cagtaaacagctccgtagca aagacgtacg cgtagcgcag atccatggcc gacagacgcc 22080 aggcgcaggttttttcggtc ttcttcagac cgcggcctgc tccgcccgac gccgcctgcg 22140 ggttgcagcaggtgaacacc atggtatgcg ggtttttctt gtgagccttc atatcgggcc 22200 tgctctcgaccatgtcgcga gtaatgtcgt ctgtgccgtt gagcttataa ggagtcattt 22260 tacagaactgtctccctgaa atggctcgat cggaggcgta gttgcagttg caattggttg 22320 agatgaggacacattcctct gcccggcgct tgtccgcgtt ggggtaaagc gccttcgtcc 22380 agtctatgtcgtgacgcatc gcgctgaccg ccttggcggc gtcggaaaag accatggcgc 22440 aactgccgtgcgcgtgggga tgagggaagc cgctgtgctc ctgatccttg tagcacaccg 22500 cgttcgcgtcgaacctgagc accaccacct gtcgtccaaa ccggttcttc tcgattacgc 22560 cgttctgttcggccagagcc ctctttcccg cctcgctaga cgggttcaac tccacggtac 22620 gaggtttcgtcaccatgtcc acaccgtgca tgcatttcgg gaagggctcc tggagcgctg 22680 ggaaccagccatggcgccaa acgtgcgccc ctccgggcac gaacttgggc tccagacccg 22740 cgcggctgtagataacggct gccaggaacc gccctacctg agcgttgaaa gagtcgtagc 22800 tagaaaaggtcaggcgaaat tcgggatgct ttttgcgaac gtatgtaccc cccattttgg 22860 tccaaatggagtcgtcgggg cgcacgctgg ctccctgcca ctgtaggtcg agagcttcgc 22920 agatgctggcgacggtggcc atggcgcgtt gcgcgccgta gaccacgggg tctgacagag 22980 gggcctccggggattcctcg tcgctggcgt tttcttcgtc atcgacaacg gtttcccgcc 23040 ggcgggtaactcgccttacg ggggatgggg actcctcgcg gcggctgacc ttcttgcgag 23100 tcgcgcctcggcccggggcg accacttcca cttcttcctc ctcctcttcc atcatgactt 23160 ctgccgttctcttgacaggc ttggtgctgc gaaagccatg agctcttttc ggggttcttt 23220 ccatgacttctgcttcggtg acgggatctc gcgtttcaaa aagttcttgc tctccctcct 23280 cttcagagtcagggactact gccggagagg gtggaagcgt cttttgaagc ttcctgggac 23340 ctatagggtaaagttaacgc ccatcgtcag cgagaccacg cctcgctggc cgatgggatc 23400 acgagacacgataaaagacc gcgaccaaaa cactcttggg gctagtatcc ctacccgggt 23460 gcgagcgtggcagatcttcg ctcttctgct tctccagtgg attctcgggg tctttcggcc 23520 ccgtcggtctctggggtggg agaggcctgc tcctccctct gtttgacttg attaccgtcg 23580 acggcccgggctcttcgagg tccacgaagt ccgccacgtc ttcgtcgctg ctgatcgtct 23640 ctgggtgaagcgtttctgcc atcgtggctg tcatcgaaaa tggcagacaa gattacccga 23700 gaggaaaaaaccatagcgac gctggacctc gtgttacgcg tggtcgtcga tgctggtaac 23760 tgggacgtgttctcgaaacg tttggttcgc tacacacgcg aacagtacgg aatcgagctg 23820 cccgaagatatcggggactt accggacaca tctgaggtct cgaaagtgct gttgagtcat 23880 ttgggggaagacaaggcggt actgtccgcg taccgaatcg cggaactgac gcaaccttcc 23940 gaaatggaccgcgctaaggt cacagaggga ggcctggccg tacttaacgc gagtcgcgat 24000 gaaagcgaagctcagaaccc ctcgaacccc gaacccgaga gcatcgagag cgacgccgta 24060 gaggatctcggcgttgcagc agagagcgac cctagcgatg acgaacccga cccagaaccc 24120 gagtatgaccatcgagaggc ggatcatgac tctgatgcgg atagcggata ctattcggca 24180 gatgggggacgacctggaac accagtggac gaggagcccc aggacgattc tccctcttcc 24240 gaggagaccgcatccactgt catcgaagaa gcgcagacta gcgctagcaa cgattctcat 24300 gacgacgacactcaccgcga cgacggcagt gcttctgaag aggatctcga gcgggacgcc 24360 ctcgtggccccggccgatcc ttttcccaac ttgcggaagt gtttcgagcg ccaagccatg 24420 atgctgaccggggcgttaaa agacgccgcg gacacggctg atccgccaga aacgctctcc 24480 gtcgacagcgtgcaaaggca gctcgaacgc ttcgtcttta accccgaccg ccgcgtgccc 24540 gccgaacacttggaggtacg ctacaatttc taccctcctt tcctcacccc caaggccatc 24600 gcgagctatcacatctttgc cgtcaccgct tccatccctc taagctgcaa agccaaccgc 24660 agcggcagcgaccttctagc caaagcaaaa gagagcactt tcttcaaacg cttacctaaa 24720 tggcgtctcgggatagagat cgacgacggg ttgggaacgg aagtcacggc ggtaacagag 24780 ctggaagaggcaaaaatggt tccgttaaag gacgacgtgt ctcgtctgca gtgggcaaaa 24840 atgcgcggcgagcacattcg cttcttcagc tacccgtcgc tgcacatgcc tcccaaaatt 24900 tcccgcatgctgatggaaac gctgttgcaa ccgttcgcgg acgaaaacca aaaggcggaa 24960 gaggcacttccctgtctgtc ggacgaggaa gtgctggcca tcgtggaccc gacagggcgc 25020 ctccacggcgaggacgcgct caaggccgtg gaaaagcgga gggccgcggt cactatggcg 25080 gtacgctacaccgcgaccct cgaactcatg gaacgcgtgt tccgcgaacc gtctatggtc 25140 aaaaagatgcaggaggtcct ccaccatacc ttccaccacg gcttcgtcgc cctggtacgc 25200 gaaaccgcaaaagtcaacct gagcaactat gcgaccttcc atgggcttac ctacaacaac 25260 cccctgaacaactgcatcat gtccaagctc ctagaaggag cagacaagga ggactatgtg 25320 gtggactcgatctacctttt cttggtcctg acgtggcaaa cggctatggg tatgtggcag 25380 caggccatagacgatatgac tatccagatg tacaccgagg tctttaccaa gaataagtac 25440 aggctgtactcgctgcccaa cccgaccgcc atcggcaagg ccatcgtgga catcctcatg 25500 gactacgaccggctcaccga ggaaatgcgg aaagcgctgc ccaacttcac ctgtcagagc 25560 cagattactgccttccgcca ttttctactg gaacggtcca acatcccagc ggtcgccgcg 25620 cctttcatgccaagcgactt tgtgcctctg gcttacaagc agagccctcc cctcctctgg 25680 gaccaggtctatctgctgca gctggccttc tatctcacta agcacggagg ctacctgtgg 25740 gaagccccggaggaagaggc caacaacccg tccaaccgga cttactgtcc ttgcaatctc 25800 tgcagtccgcaccggatgcc aggtcacaat gcggcattgc acaacgagat tctggctatc 25860 ggaacgttcgagatccgcag tccggacggg aagaccttca agctcacgcc tgagctgtgg 25920 accaacgcatacctcgacaa atttgacgcc gaggacttcc acccgttcac ggtgttccac 25980 tatcccgagaacgcatcgcg gttcgcatcc actctaaaag catgcgtcac gcagagcccc 26040 gaaatcttgagcctgattcg ccagattcag gaatcgaggg aggagtttct gctcaccaag 26100 ggcaagggggtgtacaaaga cccgaacacc ggagaaacca tctccagaca gccccgggac 26160 actgcccgcgcgcagcacgc tggagacggt caagctctac cagcccctgg agcctatacc 26220 accggaggaaatagagcgga gacagcgcct gctggagctg tacggcttgc cccggactac 26280 caagacgggcagtttcctat cgcgaaagtc ggcccgcact accatggccc aaagaatgtt 26340 agacgagaagaccagggtta cagaggcggg cccggaggtg tacggggaga gcgcgaggtc 26400 gtcctttcacgaagagcagg aggaagacgt ttcggacgga gaaacactag gcagtcagga 26460 tacaacgaacgggctaaccg atatttcgga agaggaggag gaggatctgt tcgagggcaa 26520 caaggagaacatcccaccac ctcgccgtcc gcctcggaac cgccggctcc gagccgcata 26580 ctcgctcgaggaacccctcc ttcccccgag cgccgcgacc gacaagaaga gtaagaaagt 26640 cccaaaaaggcgaggtaaat atcgcagctg ggctaagcac cgcgtggcga tatgccaggc 26700 acttcgcgatgcggtctttg accgcaaaaa ggcgggcgaa atcctcaagc ggggtcaccg 26760 gctcttcgtgcccgctactg tcataggcta ctatgctcgc aaactctctc cctcatttct 26820 cgctcctctctccagccaca ccgcacccct cctcccacca aaaaaacacc ggcgctaagg 26880 ctgtgcgtctgcgccaagat ccggtgccgc agcacatcgc ggacctcaga ggggaaatac 26940 tcgacatcctgttggaaatc gagtcgtacg cccgccgccg tcccgaccgc cacgtgtcca 27000 ttcgcaacagaacgcgcgaa agcatcaccc gaaaactgca ttacgagaaa aatgaagata 27060 agcttacccgtatgaagagc gatgctatca agttgctcgc tctctggcag accgtttaac 27120 tcgtgttcctttatagccct tcggaaccat gaacctgatg aacgccacac ccaccgaata 27180 cgtatggaagtacaacccag tctccggcat tcccgccggc gcgcaacaga attacggcgc 27240 cactatagactgggtgttgc caggaggaac cggtttcgca atagcaacca acgacattcg 27300 aagacaaacccttaacccgg ccgtgacccg tgcaattacc gcgcgttttg aagctgagtc 27360 agaccagcaaccgtacgcta gccctcacga gaccaatgtt atcgcggcca atgtcctcga 27420 ctcgggttatcccaaatccg gtctctaccc attagagctc agcggcaatc agcgcgtaca 27480 gctggcaggcggcctaatgg taggtcgcac tgagggcagg atgcaattag cgggcggttt 27540 aacagaaggaagagtgcaac tttctggagg tttccacgga cggccgttgg ttagagggcg 27600 gagcagaagaccgcccagat ggtgcggcgc cgaactgact gggaacggac tgcccgagca 27660 agccgaagtcacttctgaca cttacaagta cttcctgaga acacagggtc ccagccaagt 27720 ggttgaagagcccggcgtct tttcgcaaag acaatttatg actaccttcc tcccctccgt 27780 tgtccctcatcccttcgaca gcaccaaccc cggcgatttc cccgcgcagt acagtgccat 27840 ctacaaaggccgcacggcct tcgaagacac cttttgggac tggtgaagcg caccttttgt 27900 tggcgatgctccgtttcgca ataaatttct tccaattctc tgtcgttaaa cggctcccgt 27960 ctggtcactgtcacgcgctc gccgccctcg ctcgtcaccc gcgcgcggta ccgtcgcctc 28020 agccagaatacaaaaccggg gttcaggggt tcgtcgaacc gtaccacagc ctggtcgttt 28080 aatctcaaccaatattttct agggttcgac atcatgaacg aggaggttcc cctaaagcgt 28140 gtcagccctgacgaaaccga gacggttccc aaaaaaccgc gaaccgacgt tcgcgacacc 28200 gtcagggccggcactgacga cacggtagat ctcgtgtacc ctttttggtg gaatctcgga 28260 acgggagggggcggaggagg aggaggcggg ggcggcggca gtggaacctc tctccagccc 28320 aatgacccgctttacgccgc cagcgggacc atcaacctac gcatgacatc cccgctaacg 28380 ttgtcacaacgagccttggc tctcaaaacg gacagcaccc tcaccctcaa cacgcaaggc 28440 cagctgggcgtcagcctcac ccccggagac gggctcgtcc tcaacaccaa cgggctcagc 28500 atcaacgcagacccgcaaac cctcgcattc aacaacagcg gggcgctcga agtcaaccta 28560 gaccccgacggaccctggtc taaaaccgcc acggggatcg atctgcgtct agatccgacg 28620 acgctcgaagtagacaattg ggaactagga gtcaagctcg atcccgacga agccatcgat 28680 tccgggcccgacggtctctg cctcaacctg gacgagactc tgctgctcgc caccaacagc 28740 acatccggcaaaacggagct cggggtacac ctcaacacca gcggtcccat tactgcggac 28800 gaccagggcatcgacctgga cgtcgatccc aacaccatgc aggtgaacac aggaccttcc 28860 ggaggcatgctggccgtcaa actcaaatct ggcggcgggc tcaccgctga ccccgacggt 28920 atctcggtcacggccaccgt cgcgcctccg tccatcagtg cgacagctcc tctcacctac 28980 accagcggcaccattgcact cactacggat acgcaaacga tgcaagtcaa cagcaaccaa 29040 ctggccgtgaagctcaaaac gggaggcggt ctgacggctg acgcggacgg aatctccgtt 29100 tcggttgcaccgaccccgac gatcagcgct tctcccccgc taacctacac caacgggcaa 29160 atagggctctctatcggaga ccaaagcctc caagtcagct ctggacagct ccaagtcaaa 29220 ctgaaaagccagggcggtat tcaacagagc acgcaggggc tgggagtggc ggttgatcaa 29280 acccttaagattgtgtctaa cacgctcgag gtcaacacgg acccgagcgg acccctcacc 29340 agcggcaacaacggtctcag cttagcggcc gtcacacccc tagcagtgtc ttccgccggc 29400 gtcaccctgaactatcagtc ccctcttaca gtcacgagta actctctcgg gctctccata 29460 gccgcgccactccaggcggg tgcgcaaggc ttgacggtaa acacgatgga acccttgagc 29520 gcctcggcgcagggcatcca gctgcactac ggacagggat ttcaggtcgt cgcgggcacg 29580 ctgcagctgctcactaatcc ccccatcgtt gtctcatccc gcgggttcac cttactctac 29640 actcccgccttcacggtgag caacaatatg ttggggttga atgtagacgg cactgactgc 29700 gtggctatcagttcagccgg cctacagatc cgtaaggaag ccccgctgta cgtgacctcg 29760 ggaagcactccagcattagc ccttaagtac agctccgact ttaccattac caatggtgcg 29820 ctcgcgttagcgaacagcgg cggaggagga agttccacac ccgaggtggc cacctatcac 29880 tgcggggataacctactcga gtcctacgac atcttcgcct ctctacccaa caccaacgcg 29940 gctaaggtggcggcttactg ccgtttagct gctgcaggtg gcgtggtcag cgggaccatt 30000 caagtgacaagctatgccgg acgatggcct aaagtgggca acagcgttac ggacggcatc 30060 aaatttgccatcgtcgtgtc tccccccatg gacaaagacc cacgatcgaa cctcagtcag 30120 tggctgggtgccaccgtatt ccctgcgggc gcgactactg ctctcttctc acccaacccg 30180 tacggctccctcaacaccat caccacactg ccatccatcg cctcggactg gtacgtgccc 30240 gagtccaacctggtcacgta taccaagatc cattttaaac caacggggtc gcagcagctg 30300 cagctcgcgagcggagaact cgttgttgca gcggcgaaat cgcccgtgca gacgacgaaa 30360 tacgaattgatctatctggg atttacgctt aagcagaact cctcgggtac caacttcttc 30420 gatcccaatgcctcctccga tctatccttt ctgacaccac cgattccgtt tacttatctg 30480 gggtactatcaatgaacttg ttaactcctg cagcagcagc agcagcagca gcagcatggc 30540 tgaccagaaaaggaagctgg cggatccgga tgccgaggct ccgacgggca agatggcccg 30600 cgcgggtccgggagaactgg acctcgtcta ccctttctgg taccaagtag ccgctcccac 30660 ggaaatcacacctccgttct tggacccgaa cggtcccctg tactccacgg acggcttgtt 30720 gaacgtcaggctcacggcac ccctcgttat catccgtcaa tctaacggca acgcgatcgg 30780 ggtcaagaccgacggaagca ttaccgtcaa tgcggacggc gcgctgcaga tcggaatcag 30840 cacggccggacctctcacca ctaccgccaa cggcatcgat cttaatatcg atcccaaaac 30900 cctggtcgttgacggtagca gcggcaagaa cgtcttggga gtgcttctga aaggacaggg 30960 ggcgctacagagcagcgcgc aaggcatagg cgttgccgtc gacgagtctc tacaaatcgt 31020 cgataacaccttggaagtga aggtagatgc tgcaggtccg ctcgccgtca cagcagccgg 31080 cgtagggttgcagtacgaca acacccaatt taaagtcacg aatgggactt tgcaactgta 31140 ccaagcgcccactagcagcg tggccgcatt tacatccggg acgatcggct tgtcctcccc 31200 tacgggcaattttgtgagct ctagcaacaa cccgtttaac gggagctact tcctgcagca 31260 gatcaataccatgggcatgc tgactacctc gctctacgtc aaagtcgaca caaccaccat 31320 gggtacgcgtcccacgggcg cggtaaacga gaacgcgcga tactttaccg tctgggtgag 31380 ctccttcctcacgcagtgca acccctcgaa catcggtcaa gggaccctag agccaagcaa 31440 catcagtatgacctcttttg aacccgccag aaaccccatc tcacctcccg tgttcaatat 31500 gaaccaaaacataccctact acgcttcccg attcggggta ctggagtctt accggcctat 31560 cttcaccggctcgctcaaca cgggaagtat cgacgtacgg atgcaagtga cgcccgtcct 31620 cgccaccaacaacacgacct acaatctcat cgcctttacc ttccaatgcg ccagtgccgg 31680 actgttcaatcccaccgtga acggcaccgt ggccatcgga ccggtggtgc atacctgtcc 31740 cgctgcccgcgcccccgtta cggtctgaac aataaagaca aggtgaacca tttatacagt 31800 ctcacgtctctttattgcat acgctccgct aaatgtttcc attcgctcat ttgccagtaa 31860 tacagcagattcgcaaactc actgaaccaa tcttctgtat aaaaatgtac gcgctgcgtg 31920 tccaaatcaacatcaatttt cctcatatac agacaggggc tgccacccgc ctcccccaag 31980 cgcgacaccgcaattaggaa tggtagcctg ctgtgcaggt ccacgtgaat taacatcccg 32040 cacacgttcccgatcggtcg ctgcataaat actggagaga aatcgctaaa ccccggtgac 32100 gcccacatagccacgaagta cacccctgcc acattcaagt catcctccaa cctggcccaa 32160 acataagtggccaaatcgga aggagccagg tggcaagccg ataaccccat acgatgcaaa 32220 ggtaacccgtggcaagcgca tcccccgaaa tgaagttcga aagaatcgta acacagtagc 32280 tgataggcatgaagcggcgt cggcatctga agaccgtcat catcttcgtc gtcttccatg 32340 tcatccccaacttcctcctc gcgctccgct tcctgttggc ggcgctgctg gtgctgcagc 32400 accatctccaggatctgctc gtcgttcatc ttaatccgga attatcgcgt acggatgttc 32460 ctcgtcgtccgaactgacaa cagaaggcgg aggagctgtc agtggtgctg tagaggctaa 32520 cgatgctgcagcaccggtct cttgcaattc gaaataccaa gggttgctac tgacggtcca 32580 gttcccgccccgtgaaccag gccagcggga aatcggtgca ggtaggggat ccggtgaagg 32640 agaccgggaatggagggaag gaactgcgag atccttatcc actcgataca aaccgtataa 32700 cagggagcccaacgccaggt acaccaggaa cgtactacaa acgaacacgc tgattacaaa 32760 gtttaacgaagacagatggt tctgtaggaa caggaagctg cacaggatga tgctgctgta 32820 ggacagggcgaccaggaggg ccaagaactc gcgccaatag cgtccacaac actgcaaaat 32880 caaacacgtaattagctata cggacgttca ccagcgactc tcgcgcgtcg ttccataaac 32940 acattgcgcagataagccaa ctgagcagaa cagaacagag agagaggttg ccgcgtcgaa 33000 cactgtttgcactgtccgaa acactcggga tgagactccc cgtacttccc gtgcacatga 33060 aatacccactcttcgacgtt agcgtaggac aaccgacggt aaccggggaa gtaacacagt 33120 ccctgcgacacgatcggcca cagttccggc gacaccatca cgcggagcat gcttctcagg 33180 cagacgggttgggtcacggc gctgctgcga gaaatagttt ccaccatgat ggtggccgtc 33240 acggtcacgcgacacgcatt catgagaaca accggagacc gcacaaaagg aacagacgaa 33300 ggcacttgcgaaaaggacac ctcaaagctc atcgagcgga gagccatctt tacggatatc 33360 ttctccgcaatcagaaagcc tgtggtgaaa aacttaaaat ctgtcttcct gcgagcaagc 33420 atccacggttccaagtcgta cttcatccca ggtgtaataa aaagcaacct acgtgagagg 33480 tagtgcagcagggcggtctt ctctgcccgg ttcaggtgaa gggcactcac aatccggact 33540 atgcaattcatgagtacgta gtctcgccgt ttgaaacaca caaactttgc gctgcgtacc 33600 gtcactagcaccgtgacaga actatgcaat cgcctcatca gatcattatc gaaacaccgc 33660 aagctagccacggctctatt tatgcggtac tcgttcagtc tctccatttc ctcctgtcga 33720 caagttggatcgtgacgtag gagaaaacta aaccatatta ctcctacttc gttatgaaag 33780 ccacaagctctgctgacggt taaagactcc ttaagaaaga aaaggtagta cagtcaagct 33840 gacccatacaggtgaacccg ccccaagtca cgtaagtcaa ctcaccgaaa gaggacacag 33900 agccatatccgctgcttcaa agctttattg acgggtctaa aggcgtaaag aaaagaagaa 33960 tttaccgttctgcatctcaa accccaccac cacgcgaaaa agtccgaacg atgctgcagc 34020 accgttcacgcaaaagtccc ggacgcgcac aaataaaccc ttaatcccga taacggtgct 34080 gcagcaccgtcacgctgctg cagcatcgtc agacgtttat gacatgcagc cgattccgtg 34140 cggatttatggtcccataac ggttccaggt cctttcgttg ctgcagcacc gtttcacgta 34200 aagaagctgtacaggtcaaa ctggtccgga ttactgatta ttcgggggag agccacgtga 34260 cgtagactcgaacgacgtcc acttccgata caaaccacat ctctctggac acaggcttgt 34320 ccgccttgagccaaaccatg tgattcttcc ggtccgttct gacgtcaatg ccgacacgcc 34380 tcttggtgtttgaacaggca tcccagtacg tagccaggac cactcggtga cgtcgaggtt 34440 gaggtttaaaggtcactacg gcctgtaacc cggtactcca gggccctaaa ctaacatatt 34500 ctccggccctgcccactaca ttcgggtgat caaatatgac ataatcctta aacaatttgg 34560 ggaacttcaaacagccacat ttgggcggga caggaagatg gtgcgcagaa acattgatat 34620 gagagcgccatctagggaca tcaaagcggt gcccgcctgg atggcaatcg tacttcttag 34680 ttccggtaaagtaataggtg tgagtccgga aacacgtaaa ctccgtcact tcctgtgtcg 34740 tcattgccctcgcccctagt gacgtcagag tgccacgccc cttgtacagt ctaataaatt 34800 ttaatacacccccgccccta gcatataaaa caatgggagc ctcgcccaca ttcctatccc 34860 taataaaataccatctgacc gaatacccgt gttccatccc tattgtttta gtataataag 34920 ggtcataagtccacaactcc gcggcattgc cctctgtcac caccaacagc aaataggaag 34980 atgccatgtcatcctctcgt aaaagcatcc tccaatcagc tacctcttcc gtgtagtact 35040 gagttggctgactgtaatca cgccccgtga cgtaggctga ccacgcggaa ggaacacttc 35100 cgtgcatgctcagtagctgg tcagcctggt gagtatccag ggccttcata aaggtcaaag 35160 gcgccataatgtgataacac agtccatccg atcggaggaa atcaaaggga atatgtacat 35220 cttcacaatggcgcctccca gaataagacc atgcacagat ttgacctttc caccaagcac 35280 gtgactcgcaagcctcatgt ttttgaccag tcagatcgct ccgtatatac tcgtctccta 35340 ttggtcgatcaaaaccgtac gaaccaatac tagacgcaat caccactacg taatccgcgt 35400 catccctagagataacacta tccgcatccc tattggctga ccaatgacca ccggaagaca 35460 gcaaaaggtttaaccctttt gtgtgtaagt ccatcctata accctggaaa ctttccaatg 35520 ggggatctagcgccactatg cggccaacct tttttgaccc tctgtccgga ctagaagttg 35580 gcgggacaaagccgcgcata cagtgccccc tagcgacatc cctatgcaat gaattcgatg 35640 gtccttgaactccgtaaaaa aatgagcagt ggtcctgact gcgtaatagg ccggccccct 35700 cacatcctgcccccacaaaa gggcgtctac cttcttacaa atatctctca gctgattggt 35760 ccagtccaacagaatgaccg gggactctgg cgtcataatg gtatgcatac gcaaaatctt 35820 tctcatcatttcactggtcc atttatatgt gccgtcatag cgcgccctat taataagcgg 35880 acacacatcgggatacatgt cctgaaccag aataatgagc tccccgctat ctttaccatc 35940 caataccccgagccgccgca tttgactgac aacccagggg ccgtccgaaa aagtcaaaaa 36000 agtctcattccaccatacaa ttaacttggg ccacgaagga aaatccggcg aataggtgcc 36060 catcaagcgcctgacgtcag ccttaggata tggcggatcc catctggaat ccgacccatt 36120 aaagcacgaagatagggcag acatcggcca atggccagga gaatgggtag aattaataag 36180 gactccgcctccatttccga gcttttaaaa aaaagagaaa atggaaatca gccaagagac 36240 caccaccccgtgattggatg attggtcatc agaagatcga taagggaatt tattttctgg 36300 gagccccccccccccctact cctatttaaa aaataaccct ttcctcacca agctcagaag 36360 acagaggaggagagtagagc gccgctcaga ggtcatccgc cgagagaaaa tcccgcgcag 36420 agaacagagctctcaggtag gggtctggag ctctctggaa aaatcgcggg ctcttataca 36480 cttactctccgcccattcga aagccgcgcc tgactagagt acacactata aattccattc 36540 cggtgacttactactaggcg ctggccactt atcaaaagaa acagttctaa gaataggaca 36600 aagtccaaccgcaataaaac acccttgtca aacatgataa gagtgttctc gagaaggtac 36660 tggaaaagcaaacagtccaa ctcccaagtt aaatattacg caaagaggcg taacgagaaa 36720 agactagaaagtgtaaacac acctctccta gttatatata aacccagcgg ggcagtccct 36780 agaagaacactacctcaatc cagttacaca ttaacccggg aacctattat tgattaacta 36840 gacagtacttcctcattttc tactggaact ttccactgcc ctccggggat tttccattgg 36900 caatcattaacttgactttg tactttatgt ttactctcca tagcaacgca ccttatatgg 36960 aaaatatgctcctccccgga ccgcccatcg taccacctga gcaggtaggc tgtacctttt 37020 cctattggcccattatgagc tcacctggtt aatcatatac ccgctccgcc tatataggta 37080 gcataccgggacaggttccc tcacagtcta ttgcagactg ccgaagagag aggagctccg 37140 cataggactgggaccagaac cccgagactc tgccggtaat attttaattt catttaatcg 37200 aatcaaataaatcaaaaatc aactcaaacc catgattctc aatggaaatt tcttgtgatt 37260 ttctttcgcgcgcgaccacc ccctatggca cccccctgta cacccccctg tacacccccc 37320 tgtacaagggaacctacccc cctgtacagc gaccaccccc catggacacc cccctgtaca 37380 ttctacaggtatggcccgca acccattccg catgttccct ggggaccttc catactacat 37440 ggggaccatttcctttactt cggtggtccc tgtggaccct agccagcgga atcccaccac 37500 tagccttagagaaatggtga ccaccggcct gatttttaac cctaacctga ccggcgagca 37560 actgcgggaatactcattca gccccctagt gtccatgggg agaaaggcaa tcttcgcaga 37620 ctacgagggtccccagcgca ttatccacgt taccattagg gggcgctccg cggaacccaa 37680 gacccccagtgaggccctca ttatgatgga gaaggcggtc cgtggcgcgt tcgcggttcc 37740 tgattgggtggccagggaat actcggatcc cctcccccac ggcataaccc acgtggggga 37800 cctgggcttccccattggtt ccgtgcatgc cctgaagatg gcgctagaca cactgaagat 37860 ccatgtccctcgcggagtgg gggtccctgg ctatgagggt ctctgtggga ccaccaccat 37920 caaagccccccgacaatatc ggctcctgac cactggagtt ttcaccaaaa aagatctgaa 37980 aagaacacttccagaaccat tcttcagccg attttttaac caaactcccg aagtttgtgc 38040 catcaagactggcaaaaatc cgttttctac agaaatttgg tgtatgactc tcggcgggga 38100 tagccccgcccccgagagaa atgaacccag aaatccccat tctctccaag attgggcaag 38160 actgggtgtcatggaaacct gcctacgtat gagtaggcgg ggactcgggt ctcggcacca 38220 cccctaccattctctgtaac caatccctga ataaagattt gcataacaga actttgactc 38280 ctccttttatgtgggtgggg taatgggcgg cacttggggg taatggcggt tcctattgga 38340 tgggtaacaccgactccgcc ctacaaagtt aatgattgat ttttcggact tagaaaaatt 38400 tcgactgtcacctggatgtt tttccccact taacctctag ggggagatag atcgcgtcca 38460 aggggaggagctcaataccg gaccgcctat taggtgtggc ttcgggctcc gcctagtggg 38520 aggagacaggaaaaccacgc ctagtgacgc tgggtcaaag tccaagggga gtggtttatg 38580 cgcaccgccttggggcgtgg tttgggcggc gcaaggtaac ccttggactg ggaggagact 38640 tctgtcccttgggcgtgtca aacaggtaaa ccccacccgc gcgattaatg attaattttt 38700 cggacttagaaaattttcaa cctgatactt tattttcaag cttttcccgc cgacgggcaa 38760 gcctcctatctctccgtcta tgactccaca gagcctcatc tgaatatgta aatgtgctga 38820 accgcaaccccgtagaccgc gcccacccca gcatcaaagg taacgccccc gatgccacaa 38880 tgtaattacccactgttaaa ttaggatcct tacaccaatc atttctgtac aatttaaacc 38940 accgcccacgcgggactttc ccgtggtggt agaaaaaggt tttgaaaaac gcgcgcatta 39000 ttttcgtggcttcattaata gcggacatgc gcagatccag aaaggtcaga cacaccacca 39060 cggtcacatgacatttgcat ggagacaggg attggatacc gacacaatac gccatcggat 39120 actgaatgtccacattggtc cgtgcatata cagtgtgcca cctcctcttg accgcagcca 39180 cgaatccccagaagagttct tcccgccaca aaatcgaaac cggggcggcg gccccaaagc 39240 acaaatacaaaggcatcctc ctgaggctct agaaaaaaca actcattaac aggcatcccg 39300 ctcataggtacattcgtgta aacagggctg catgccaacc aatgccccgc gttttacaaa 39360 gtgacgggccaccctattgg cggagggggt ccacgtattg cgcaccgcgt aaatagaagc 39420 cacccctcgcggaacctgtg tacattcaaa tctcctccaa atacattcgc gcagtaaagc 39480 caccgcccttttcaagaaag tccaatcaac cttatgcgtg ggcaaaaaaa tagaagctga 39540 atatacccccgcaaactcct ccaatcggaa caggtaatct acactatagt gggacagcat 39600 ctcaacagttaaactttccc aggcatttat caccgtcaat ttcagatcat ggaatacggc 39660 caagttaggctccatcaagg tcacgcggag gtggaagtaa tacatcccga aataccctgt 39720 taaaaaaaatagaaaaatga actaaccgac aataagatcg gcagtaccca gtttcgatct 39780 ggggacctccggagtgcaag tccgacgctc ttacggctga gctacactgt cgatcttgat 39840 ccgctagggtacgcagtccg gagaagaaat atactaagtg agacccggtc ctatatatac 39900 aggttggttcaaaggaacct ttgtacccat taaaacaggt gcgtgactgt agaagccaca 39960 cccctacctgtaccgataag gcacaccctg agcaaacaaa ccataaaggt atacttcctt 40020 attcagacaggtataaatgg aacctccgca caacagtccg gtaccatttt ccatcgcgaa 40080 aatgggcaaccctactctgc tccttctttc aggtctcctt tctctgaccc aggccatttc 40140 catcggagaacacgaaaaca aaacccggca tgtgattgta tggcggcact cctcctccca 40200 ccaatgctctgattggagaa cagtcacgga atggttcccg ccccaaaaag gcaacccggt 40260 gagaccaccctacacccagc gggtttccct ggatacggca aacaataccc tcacggtaaa 40320 acccttcgagacaaacaacg ggtgttggga aactacgtca caaggcatta accatccacc 40380 aaccaccattcagtaccggg tatggaacat caccaccacg cccaccatac agacaatcaa 40440 cattaccaaaataactgttc gggaggggga ggactttacc ttatacggac ctgtgtccga 40500 aaccatgagtattatcgaat gggaattcat caaggatgtc acgccccagt tcatcctcca 40560 atactatctctccattaact ctactattgt gtacgcaagc taccaaggga gagttacctt 40620 taaccccggtaaaaacacac taaccttaaa aggcgcgaag accaccgaca gcggcaccta 40680 caagtccacggtgaacctcg accaggtatc cgtccacaac ttccgagtag gagtcacgcc 40740 catcgagaaaaaagaagaag ctaccgcaga gacacctgcc agcaagccca cgcccatacc 40800 acgtgtccgagcggatgctc gaagtactgc cctatgggtt ggacttgccc tttgcatcct 40860 gactgttatacccgccctta ttgggtggta cttcagagat aggctctgtg ttcccgatcc 40920 aatcattgaactggaaatcc ccggacaacc ccatgtaaca atacacatat tgaaaggtcc 40980 cgatgatgattgcgaaactt aatgattgac aaacgtaata aaaaagctgt gacgcacata 41040 agtacgtgtctgtgtcattc atccatacct atatatggtg atagcccctt cctcaataca 41100 caccgagccgcgatggaccc cagaccactt gttctgctcc tcctcctagc gtcccatata 41160 agtacattccggcaaatgta ctttgaaggg gaaaccatcc atttccctat gggcatatat 41220 ggaaatgagaccaccctcta tatgaatgac atcatcctgg aaggaacacg cgccaatacg 41280 accacccgtacaatcagcct cacgaccacc aagaagaatg cgggaactaa cctgtacact 41340 gtgatctccgaaacgggaca caacgccacc tatctgataa ctgtacaacc gctgggacaa 41400 tcgatacaccacgcctacac ttgggctgga aatactttta ccttacaagg acaggtattt 41460 gaacacggtaattatacacg atgggtgcgg ctggagaatg cggaaccgaa actcattatc 41520 agctgggcattgtccaacag aacaataaac aaaggaccgg cctatactgc aaacatggac 41580 tttgatcccggaaacaacac cctcactctc caccctgtgc tgataacaga tgccgggatt 41640 ttccaatgcgtcattgatca gcaaacaaac ctaaccctca ccataaactt tacagtctcc 41700 gagaatccaccaatcgtagc acacctggat atccataaaa ctatttctag aacaattgcc 41760 atttgtagctgtttgcttat cgcggtaatt gcggtcttgt gttgcctacg tcagctcaat 41820 gtaaacgggcggggaaattc cgaaatgata taaaacaata aagcagtgtg cgtcatggaa 41880 acttttctcaggtgcttcct cattcacaca ggtatatata gggaatggaa aattagacag 41940 atacccacaccggaacaatg ctacttctca cagtagttct gttggtgggg gtcaccctcg 42000 ctgcggaccatcctactcta tacgctccga aagggggcag tatagaattg ggtgtggggg 42060 ctaaacagaaagggcaatac aaatttgaat ggcggtttgg aaatctaaaa attgtgatag 42120 ccgaaatgtcatccactaac caattagaaa tcaaatttcc cgataacggt ttccaaaatc 42180 gatccgagtttaaccccacc aaacataact taaccattca taatgccagc tacgaggaca 42240 gcggaacctactcactccac caggaagaaa atgatggcac ggaacacacg gacaacttca 42300 aagtgattgttcaaggtatg tcattatata catatttaca atatgcatta atatcaccta 42360 tctaatagagcattaattat ccagacccga ttccacgccc tgaagtcaag ggaaccacta 42420 tgcaaatcaacgggaaaact ttcaccaata tatcctgcca tctaccggcc ggttcctacg 42480 gcaatgtctcctggcattgg aattataccg acccaatcat agtcgggtac gaaaatcaga 42540 gcatgctcgttggaccttta ggggtaatgt attcatgtac ggcatccaat caagtctcaa 42600 aaaactccagtgcaataagt atggacaccg ccgaaccatc agagagtaag tagcgccctc 42660 tatagacattatatagaata taactgaaca cattaagaaa cctctgtaat tatttatagg 42720 agcggagtgcgcatatacag gatacattgc gggcattata atcttaggag tgctttgcat 42780 attgttcatttacctatatg caaatacccc tgaagtgcgg cagagaataa ccgaccaatt 42840 agaaaagcttctcggaacat tctgtgacgt cagtatagaa gacggaatac cggaacgcac 42900 cagaagaaacaaaaaaagaa tcatcttaaa ggagccctcc cataggtgga tctggattca 42960 gtcatttgcataatatgctg tgtcataacc gccaccatca ccatagcaat cattgggcgt 43020 aagtattgtgacgtaagaaa aggcatgtct aaaaaaacgg tcactcacca taatgctgcc 43080 cccgataggttgcgaccgcc ttccggagtt tgacgtagta tgcccgcgag actggatcgg 43140 atttcaaagcaagtgctact acttttcgga gtcagagtcc aattggagtg aagccgaaaa 43200 attttgtagacagcaagagg cggagctagc agttcggcgt tccgaggagg aaaaggtaaa 43260 aagttaaattccaggaaact cctaattccc cgaaaaatta caaaaattaa cggagaccct 43320 ttacaggagttccttctgcg ccaatgcgga acaggaacta actggctggg cgtaaccagg 43380 aagtcgaaggacggagctga ttgggtggat gcatcgtacg atgattacgt accatggtga 43440 gtcatgttatacgtcacatc cgggatgtga cgtatgcgga agttgatccg ggagtgaaaa 43500 cccggaagtaacctgttaat ttgcatacag gtatgaaatt cggggaggcg gagactgcgt 43560 gtatttaaatggagaccgag tgacgtcagc ctactgtgat acccagaagc tatttgtctg 43620 ttcctgtcaagattcgtatt cgtattggtt agaaaacaaa taaatcaata aactaattta 43680 tgatatcattcatatttatg ggtgtggttt tattatgcgt cataaaacta ttttgcgtat 43740 agcgacacgctgcggttatg gccggttatg actgcgttag tttttgaggt tattatacat 43800 catc 43804<210> SEQ ID NO 2 211> LENGTH: 178 <212> TYPE: PRT <213> ORGANISM: CELOVirus <220> FEATURE: <223> OTHER INFORMATION: Position: 794..1330/note=ORF1 <400> SEQUENCE: 2 Met Asp Pro Phe Gly Ser Ser Ser Val Pro ProCys Ser Thr Ser Asp 1 5 10 15 Leu Pro Glu Pro Lys Leu Tyr Phe Val ArgLeu Ser Pro His Ala Val 20 25 30 Pro Pro Val Arg Ala Thr His Gly Ala AlaGly Tyr Asp Leu Phe Ser 35 40 45 Ala Tyr Asp Ile Lys Val Pro Ala Arg GlyArg Ala Leu Val Pro Thr 50 55 60 Asp Leu Val Phe Gln Phe Pro Pro Gly CysTyr Gly Arg Ile Ala Pro 65 70 75 80 Arg Ser Gly Leu Ala Ala Lys Phe PheIle Asp Val Gly Ala Gly Val 85 90 95 Ile Asp Pro Asp Tyr Arg Gly Asn ValSer Val Val Leu Phe Asn Phe 100 105 110 Ser Glu Ser Ser Phe Asn Ile ArgArg Gly Asp Arg Val Ala Gln Leu 115 120 125 Ile Leu Glu Arg Ile Met ValPro Glu Leu Ser Glu Leu Thr Gln Leu 130 135 140 Gly Glu Thr Asp Arg GlyAla Ser Gly Phe Gly Ser Thr Gly Met Gly 145 150 155 160 Ala Val Asp ArgAsn Gln Arg Ser Val Leu Glu Trp Leu Thr Pro Gly 165 170 175 Ser Arg<210> SEQ ID NO 3 <211> LENGTH: 276 <212> TYPE: PRT <213> ORGANISM: CELOVirus <220> FEATURE: <223> OTHER INFORMATION: Position: 1999..2829/note=ORF2 <400> SEQUENCE: 3 Met Ser Arg Glu Ser Glu Arg Tyr Trp Thr LeuVal His Ala Leu Ile 1 5 10 15 Asp Arg Gly Val Val Ser Arg Glu Gln TrpGln Met Val Asp Pro Ala 20 25 30 Gln Tyr Gln Phe Tyr His Arg Ser Lys GlnArg Gly Phe Lys Val Arg 35 40 45 His Ile Leu Arg Asp Val Ile Arg His MetCys Trp Ser Arg Thr Leu 50 55 60 Leu Asp Tyr Met Ser Ser Ala Ser Thr ProSer Pro Asp Asp Val Leu 65 70 75 80 Arg Asn Pro Leu Tyr Gln Leu Leu LeuCys Asn Gly Tyr Asn Pro Ala 85 90 95 Val Val Gly Thr Ala Leu Ile Arg TrpAla Gly His Gln Ser Asn Arg 100 105 110 Asn Thr Val Trp Ile Arg Gly ThrPro Met Ser Gly Ala Pro Tyr Leu 115 120 125 Ala Gln Ala Ile Ala Tyr CysSer Pro Leu Val Gly Ser Val Asp Trp 130 135 140 Arg Asn Lys Ser Asn ProPhe Glu Gly Cys Pro Asp Ser Leu Val Phe 145 150 155 160 Trp Trp Asp GlyGly Tyr Val Tyr Asp Cys Cys Val Gly Leu Val Lys 165 170 175 Gln Val PheArg Gly Glu His Val Ile Leu Pro Pro Glu Gly Leu Arg 180 185 190 Gly ProAsn Pro Cys Ser Glu Leu Phe Arg Thr Pro Val Leu Met Tyr 195 200 205 SerGln Ala Asp Ile Cys Met Thr Arg Leu Arg Ser Gly Glu Leu Ser 210 215 220Ala Glu His Ala Val Gly Leu Arg Asp Cys Met Tyr Leu Ile Arg Leu 225 230235 240 Thr Glu Asp Phe Asp Cys Ala Gly Gly Ile Ser Cys Ala Asp Val Lys245 250 255 Gln Phe Val Ala Trp Ser Arg Glu His Pro Gly Glu Val Arg GluThr 260 265 270 His Glu Leu Lys 275 <210> SEQ ID NO 4 <211> LENGTH: 104<212> TYPE: PRT <213> ORGANISM: CELO Virus <220> FEATURE: <223> OTHERINFORMATION: Position: 3781..4095 /note=ORF3 <400> SEQUENCE: 4 Met GlyVal Glu Gly Met Trp Asn Val Phe Leu Phe Ser Leu Gln Val 1 5 10 15 AlaAla Leu Pro Ser Ile Lys Cys Ser Ile Asn Gly Ser Gly Phe Ser 20 25 30 SerThr Lys Gly Arg Gln Tyr Arg Glu Ala Trp Gly Ala Ile Ser Pro 35 40 45 SerAsp Ser Met Glu Leu Ile Arg Leu Ser Glu Ile Ala Ser Gly Lys 50 55 60 HisAla His Lys Ala Leu Lys Arg Leu Leu Ala Leu Glu Ser Leu Pro 65 70 75 80Pro Gln Ser Thr Arg Val Phe Ser Ser Pro Arg Ser His Arg Arg Met 85 90 95Ala Leu Ala Ala Thr Phe Pro Ser 100 <210> SEQ ID NO 5 <211> LENGTH: 136<212> TYPE: PRT <213> ORGANISM: CELO Virus <220> FEATURE: <223> OTHERINFORMATION: Position: 5963..6373 /note=ORF4 <400> SEQUENCE: 5 Met ValAsp Val Glu Met Phe Gly Cys Gly Gly Leu Leu Val Ser His 1 5 10 15 LeuHis Lys Phe Gly Thr Glu Arg Ala Cys Leu Arg Gly Asp Gly Ala 20 25 30 ValPhe Pro Ala Val Glu Ile Gly Leu Asp Gln Leu Gln Val Pro Gly 35 40 45 ArgLeu Phe Asp Gly Trp Asn His Val Leu Phe Arg Ser Asp Glu Asp 50 55 60 AspArg Phe Gly Asp Arg Val Gln His Val Ala Arg Asp Glu Arg Pro 65 70 75 80Gln Gln Met Arg Leu Ala Gly Ser Gly Gly Ser Val Asp Asp Pro Asp 85 90 95Asp Gly Leu Leu Ala His Val Asp Gly Arg Gln Leu Ser Val Leu Glu 100 105110 Val Ala Thr Val His Leu Phe Leu Gly Phe Asn Phe Phe Val Gly Phe 115120 125 Glu Lys Leu Leu Ile Asn Ala Pro 130 135 <210> SEQ ID NO 6 <211>LENGTH: 378 <212> TYPE: PRT <213> ORGANISM: CELO Virus <220> FEATURE:<223> OTHER INFORMATION: Position: 12193..13329 /gene: L1 /Product: L152K <400> SEQUENCE: 6 Met His Pro Val Leu Gln Ser Val Arg Asn Ala SerVal Ser Ala Gly 1 5 10 15 Gly Pro His Gln Gln Gln Pro Gln Gln Gln GlnHis Gly Val Ser Ser 20 25 30 Val Arg Arg Pro Pro Ser Pro Pro Arg Tyr ProAla Gln His Ala Tyr 35 40 45 Pro Gly Ala Gly Ala Thr Pro Thr Ala Gly ArgGly Asp Phe Asp Gly 50 55 60 Ala Leu Asp Pro Asp Glu Gly Pro Val Ala CysGly Leu Ala Ala Gly 65 70 75 80 Ala Gly Val Asp Glu Val Arg Met Arg GluArg Asp Ala Ala Arg Arg 85 90 95 Ala Thr Val Pro Glu Ile Asn Leu Phe LysAla Arg Arg Asp Val Val 100 105 110 Pro Asn Gly Asp Tyr Glu Arg Asp LeuMet Tyr His Ser Gly Gln Ala 115 120 125 Ile Asp Ile Asp Arg Gln Arg ValLeu Thr Pro Glu Asp Phe Lys Gly 130 135 140 Ser Glu Pro Ala Phe Thr ProAla Val Asn His Met Arg Ala Ala Glu 145 150 155 160 Leu Lys Arg Ala AlaGlu Gln Thr Ala Phe Gly Glu Glu Leu Arg Asn 165 170 175 Thr Cys His GlnThr Arg Ile Arg Thr Ala Leu Leu Arg Pro Glu Ile 180 185 190 Gly Ala GlyIle Tyr Tyr Leu Tyr Asp Phe Val Gln Thr Tyr Leu Glu 195 200 205 His ProAsp Gly Arg Val Lys Leu Asn Pro Gln Leu Val Leu Val Ala 210 215 220 GlnHis Ala Gly Asn Thr Met Leu Ala Gln Arg Leu Trp Ala Ile Ala 225 230 235240 Glu Glu Lys Asn Ala Trp Leu Arg Asp Leu Ile Glu Met Ala Tyr Met 245250 255 Ile Val Asn Asp Pro Tyr Leu Asn Thr Glu Gln Gln Leu Ser Ala Ile260 265 270 Cys Thr Thr Val Val Glu Leu Ser Met Lys Tyr Ala Lys Leu AlaAla 275 280 285 Lys Asn Gly Tyr Pro Ser Met Ala Gln Met Ala Lys Ala GlnGlu Phe 290 295 300 Phe Tyr Arg Val Met Gln Ala Val Leu Asp Leu Gly ValGln Val Gly 305 310 315 320 Val Tyr Asn Asn Arg Pro Ala Arg Tyr Arg GlnLys Arg Met Ser Glu 325 330 335 Ile Pro Gln Met Thr Asp Ala Glu Tyr MetPhe Gly Leu Thr Gln Ala 340 345 350 Leu Glu Ser Arg Pro Pro Gln Gly GluSer Phe Ala Asp Glu Gly Pro 355 360 365 Ser Glu Ser Asp Asp Glu Asp AspPhe Ile 370 375 <210> SEQ ID NO 7 <211> LENGTH: 575 <212> TYPE: PRT<213> ORGANISM: CELO Virus <220> FEATURE: <223> OTHER INFORMATION:Position: 13316..15043 /gene: L1 /product: L1 IIIa <400> SEQUENCE: 7 MetThr Ser Ser Asp Thr Phe Leu Ala Leu Ala Pro Tyr Gly Arg Gln 1 5 10 15Glu Val Ala Asp Ala Leu Ser Ser Leu Pro Asp Gly Lys Asp Ala Arg 20 25 30Ser Leu Arg His Ala Pro Tyr Ala Asn Arg Leu Ile Lys Leu Gln Ser 35 40 45Ala Met Val Pro Pro Lys Val Asp Gly Thr Ser Glu Arg Val Ala Glu 50 55 60Ile Val Lys Gly Leu Ala Glu Gln Gly Ala Ile Tyr Pro Asp Gln Met 65 70 7580 Gly Ala Ile His Ser Asp Leu Leu Asn Arg Ala Tyr Thr Trp Asn Ser 85 9095 Met Gly Val Gln Glu Ser Ile Gln Ala Leu Val Asn Asp Val Ile His 100105 110 Gly Gln Asn Arg Thr Leu Gln Asp Glu Leu Ala Arg Thr Lys Glu Ile115 120 125 Ala Asn Ala Ser Leu Leu Thr Gln Phe Phe Asp Ser Leu Tyr LysThr 130 135 140 Val Asp Arg Gly Gln Arg Asn Phe Glu Gly Phe Lys Lys LeuLeu Arg 145 150 155 160 Leu Phe Val Asn Asn Val Pro Asn Ala Glu Val TyrGly Ser Ser Gly 165 170 175 Ser Phe Ser Val Gln Ile Asn Leu Gly Gly SerSer Gln Asn Ile Asn 180 185 190 Leu Thr Asn Ala Phe Glu Asn Leu Lys ProIle Trp Gly Ala Arg Trp 195 200 205 Asp Ala Val Asn Asn Pro Arg Ile GlyAla Leu Leu Thr Pro Asn Thr 210 215 220 Arg Ala Leu Leu Phe Phe Val SerSer Phe Tyr Asp Tyr Gly Ala Met 225 230 235 240 Glu Pro Gly Ser Tyr LeuAsp Asn Ile Met Arg Leu Tyr Lys Glu Ala 245 250 255 Ile Arg Ala Asp ValAsp Ala Glu Gly Asp Ala Ile Met Glu Leu Gly 260 265 270 Glu Ala Gly AlaAsn Leu Asn Leu Arg Phe Asn Asp Tyr Lys Asp Thr 275 280 285 Leu Asn TyrLeu Leu Gln Asn Arg Glu Val Val Pro Asp Thr Ala Pro 290 295 300 Leu GluLeu Ser Ala Glu Gln Glu Met Leu Leu Lys Tyr Leu Met Arg 305 310 315 320Gln Leu Arg Gln Ala Leu Lys Asp Gly Val Pro Ala Asp Ile Ser Ile 325 330335 Ser Thr Met Thr Gln Tyr Leu Asp Pro Arg Leu Tyr Gln Thr Asn Lys 340345 350 Val Phe Val Glu Lys Leu Gln Asn Tyr Leu Leu Ala Ala Gln Ala Arg355 360 365 Asn Pro Val Tyr Tyr Arg Leu Leu Val Leu Asp Pro Asn Trp ArgPro 370 375 380 Pro Ala Gly Leu Tyr Thr Gly Asn Tyr Val Ile Pro Asp ArgTyr Asp 385 390 395 400 Phe Glu Asp Val Gln Ser Glu Leu Glu Tyr Ala GlyPro Ser Arg Asp 405 410 415 Glu Tyr Phe Asp Asp Ser Leu Phe Ala Pro GlyPro Gln Arg Arg Leu 420 425 430 Asn Ser Ala Glu Glu Ala Gln Leu Glu ArgAsp Ile Glu Ser Leu Thr 435 440 445 Gly His Ile Asp Glu Glu Leu Gly ValGln Ser Gln Ala Gly Trp Leu 450 455 460 Ala Asp His Arg Leu Pro Val AlaPhe Asp Gly Ala Leu Ser Leu Thr 465 470 475 480 Glu Arg Asn Ala Tyr AsnThr Pro Leu Pro Pro Asp Ser His Met Arg 485 490 495 Ser Arg Ser Ser SerVal Ala Ser Asp Leu Gly Leu Leu Asn Leu Ser 500 505 510 Gly Thr Gly GlyPro Gly Phe Phe Ala Ser Leu Arg Pro Ser Ile Gly 515 520 525 Ser Arg GlnPro Thr Gly Thr Ala Val Gly Leu Arg Pro Thr Thr Pro 530 535 540 Tyr SerGly Ser Gly Cys Met Arg Gly Thr Gly Leu Ala Arg Lys Val 545 550 555 560Leu Asn Pro Ala Ala Ser Arg Arg Gly Arg Lys Leu Arg Phe Tyr 565 570 575<210> SEQ ID NO 8 <211> LENGTH: 515 <212> TYPE: PRT <213> ORGANISM: CELOVirus <220> FEATURE: <223> OTHER INFORMATION: Position: 15110..16657/gene: L2 /product: penton base <400> SEQUENCE: 8 Met Tyr Arg Ser LeuArg Pro Pro Thr Ser Ile Pro Pro Pro Pro Pro 1 5 10 15 Ser Gly Pro SerPro Tyr Pro Ala Met Ile Asn Gly Tyr Pro Pro Asp 20 25 30 Val Pro Val GlySer Pro Ala Asn Gly Asp Ala Glu Leu Phe Val Pro 35 40 45 Leu Gln Arg ValMet Pro Pro Thr Gly Gly Arg Asn Ser Ile Arg Tyr 50 55 60 Arg Asn Tyr AlaPro Cys Gln Asn Thr Thr Lys Phe Phe Tyr Val Asp 65 70 75 80 Asn Lys LeuSer Asp Leu Asp Thr Tyr Asn Glu Asp Ala Asn His Ser 85 90 95 Asn Phe ArgThr Thr Val Ile His Asn Gln Asp Leu Asp Pro Ser Thr 100 105 110 Ala AlaThr Glu Thr Ile Gln Leu Asp Asn Arg Ser Cys Trp Gly Gly 115 120 125 GluLeu Lys Thr Ala Val Lys Thr Asn Cys Pro Asn Ile Ser Ser Phe 130 135 140Phe Gln Ser Asp Thr Val Arg Val Arg Leu Met Ser Lys Arg Asp Pro 145 150155 160 Gly Gly Thr Asp Pro Asp Ala Gly Val Asn Asn Pro Pro Gly Ala Glu165 170 175 Tyr Lys Trp Tyr Asp Leu Arg Ile Pro Glu Gly Asn Tyr Ala LeuAsn 180 185 190 Glu Ile Ile Asp Leu Leu Asn Glu Gly Ile Val Gln Leu TyrLeu Gln 195 200 205 Glu Gly Arg Gln Asn Asn Val Leu Lys Ser Asp Ile GlyVal Lys Phe 210 215 220 Asp Thr Arg Tyr Leu Asp Leu Leu Lys Asp Pro ValThr Gly Leu Val 225 230 235 240 Thr Pro Gly Thr Tyr Val Tyr Lys Gly TyrHis Pro Asp Ile Ile Leu 245 250 255 Leu Pro Gly Cys Ala Val Asp Phe ThrPhe Ser Arg Leu Ser Leu Leu 260 265 270 Leu Gly Ile Ala Lys Arg Glu ProTyr Ser Lys Gly Phe Thr Ile Thr 275 280 285 Tyr Glu Asp Leu Gln Gly GlyAsn Val Pro Ala Leu Leu Asp Leu Ser 290 295 300 Ser Val Gln Val Asp AspGln Asp Glu Asp Val Ile Val Val Ala Asp 305 310 315 320 Ala Arg Pro LeuLeu Lys Asp Ser Lys Gly Val Ser Tyr Asn Val Ile 325 330 335 Thr Thr GlyVal Thr Gln Pro Gln Thr Ala Tyr Arg Ser Trp Leu Leu 340 345 350 Ala TyrHis Thr Leu Asp Ser Pro Ala Arg Asn Lys Thr Leu Leu Thr 355 360 365 ValPro Asp Met Ala Gly Gly Ile Gly Ala Met Tyr Thr Ser Met Pro 370 375 380Asp Thr Phe Thr Ala Pro Ala Gly Phe Lys Glu Asp Asn Thr Thr Asn 385 390395 400 Leu Cys Pro Val Val Ala Met Asn Leu Phe Pro Ser Phe Asn Lys Val405 410 415 Phe Tyr Gln Gly Ala Ser Ala Tyr Val Gln Arg Leu Glu Asn AlaThr 420 425 430 Gln Ser Ala Thr Ala Ala Phe Asn Arg Phe Pro Glu Asn GluIle Leu 435 440 445 Lys Gln Ala Pro Pro Met Asn Val Ser Ser Val Cys AspAsn Gln Pro 450 455 460 Ala Val Val Gln Gln Gly Val Leu Pro Leu Lys AsnSer Leu Ser Gly 465 470 475 480 Leu Gln Arg Val Leu Ile Thr Asp Asp ArgArg Arg Pro Ile Pro Tyr 485 490 495 Val Tyr Lys Thr Ile Ala Thr Val GlnPro Arg Val Leu Ser Ser Ser 500 505 510 Thr Leu Gln 515 <210> SEQ ID NO9 <211> LENGTH: 72 <212> TYPE: PRT <213> ORGANISM: CELO Virus <220>FEATURE: <223> OTHER INFORMATION: Position: 16679..16897 /gene: L2/product: L2pVII <400> SEQUENCE: 9 Met Ser Ile Leu Ile Ser Pro Ser AspAsn Arg Gly Trp Gly Ala Asn 1 5 10 15 Met Arg Tyr Arg Arg Arg Ala SerMet Arg Gly Val Gly Arg Arg Arg 20 25 30 Leu Thr Leu Arg Gln Leu Leu GlyLeu Gly Ser Arg Arg Arg Arg Arg 35 40 45 Ser Arg Pro Thr Thr Val Ser AsnArg Leu Val Val Val Ser Thr Arg 50 55 60 Arg Arg Ser Ser Arg Arg Arg Arg65 70 <210> SEQ ID NO 10 <211> LENGTH: 188 <212> TYPE: PRT <213>ORGANISM: CELO Virus <220> FEATURE: <223> OTHER INFORMATION: Position:16929..17495 /gene: L2 /product: L2 mu (pX, 11K) <400> SEQUENCE: 10 MetCys Ala Val Ala Ile His Arg Ser Asp Val Val Met Pro Ser Val 1 5 10 15Leu Leu Thr Gly Gly Arg Thr Ala Lys Gly Lys Lys Arg Ala Ser Arg 20 25 30Arg Arg Val Lys Val Pro Lys Leu Pro Lys Gly Ala Arg Arg Lys Arg 35 40 45Ala Ser Val Thr Pro Val Pro Thr Val Ala Thr Ala Thr Ala Ser Glu 50 55 60Arg Ala Ala Leu Thr Asn Leu Ala Arg Arg Leu Gln Arg Gly Asp Tyr 65 70 7580 Ala Ala Trp Arg Pro Ala Asp Tyr Thr Ser Pro Ala Val Ser Glu Ala 85 9095 Ala Arg Ala Ala Ala Ser Ser Gly Thr Pro Ala Thr Ala Arg Asp Leu 100105 110 Ala Thr Gly Thr Leu Ala Arg Ala Val Pro Met Thr Gly Thr Gly Gly115 120 125 Arg Arg Arg Lys Arg Thr Ala Thr Arg Arg Arg Ser Leu Lys GlyGly 130 135 140 Phe Leu Pro Ala Leu Ile Pro Ile Ile Ala Ala Ala Ile GlyAla Ile 145 150 155 160 Pro Gly Ile Ala Gly Thr Ala Val Gly Ile Ala AsnLeu Lys Glu Gln 165 170 175 Gln Arg Gln Phe Asn Lys Ile Tyr Gly Asp LysLys 180 185 <210> SEQ ID NO 11 <211> LENGTH: 223 <212> TYPE: PRT <213>ORGANISM: CELO Virus <220> FEATURE: <223> OTHER INFORMATION: Position:17559..18230 /gene: L3 /product: L3 pVI <400> SEQUENCE: 11 Met Asp TyrAla Ala Leu Ser Pro His Leu Gly Gly Trp Ala Leu Arg 1 5 10 15 Asp HisHis Ile Gly Asp Ser Ser Leu Arg Gly Gly Ala Ile Asn Trp 20 25 30 Gly AsnLeu Gly Ser Arg Ile Thr Ser Ala Leu Asn Ser Thr Gly Arg 35 40 45 Trp LeuTyr Asn Thr Gly Asn Arg Phe Val His Ser Asn Thr Phe Asn 50 55 60 Gln IleLys Gln Gly Ile Gln Asp Ser Gly Val Ile Arg Asn Val Ala 65 70 75 80 AsnLeu Ala Gly Glu Thr Leu Gly Ala Leu Thr Asp Ile Gly Arg Leu 85 90 95 LysLeu Gln Gln Asp Leu Glu Lys Leu Arg Arg Lys Ala Leu Gly Glu 100 105 110Glu Gly Pro Ala Thr Gln Ala Glu Leu Gln Ala Leu Ile Gln Ala Leu 115 120125 Gln Ala Gln Val Ala Ala Gly Glu Pro Pro Ala Ala Pro Ala Ala Pro 130135 140 Ala Pro Ala Pro Pro Leu Val Pro Thr Thr Arg Pro Ile Pro Glu Met145 150 155 160 Val Thr Glu Val Lys Pro Pro Val Thr Ser Ser Ala Pro AlaVal Pro 165 170 175 Val Asp Val Pro Thr Thr Leu Glu Met Arg Pro Pro ProPro Lys Arg 180 185 190 Arg Arg Lys Arg Ala Arg Pro Gly Gln Trp Arg AlaArg Leu Asp Ser 195 200 205 Leu Ser Gly Thr Gly Val Ala Thr Ala Thr ArgArg Met Cys Tyr 210 215 220 <210> SEQ ID NO 12 <211> LENGTH: 942 <212>TYPE: PRT <213> ORGANISM: CELO Virus <220> FEATURE: <223> OTHERINFORMATION: Position: 18289..21117 /gene: L3 /product: L3 hexon <400>SEQUENCE: 12 Met Thr Ala Leu Thr Pro Asp Leu Thr Thr Ala Thr Pro Arg LeuGln 1 5 10 15 Tyr Phe His Ile Ala Gly Pro Gly Thr Arg Glu Tyr Leu SerGlu Asp 20 25 30 Leu Gln Gln Phe Ile Ser Ala Thr Gly Ser Tyr Phe Asp LeuLys Asn 35 40 45 Lys Phe Arg Gln Thr Val Val Ala Pro Thr Arg Asn Val ThrThr Glu 50 55 60 Lys Ala Gln Arg Leu Gln Ile Arg Phe Tyr Pro Ile Gln ThrAsp Asp 65 70 75 80 Thr Pro Asn Ser Tyr Arg Val Arg Tyr Ser Val Asn ValGly Asp Ser 85 90 95 Trp Val Leu Asp Met Gly Ala Thr Tyr Phe Asp Ile LysGly Val Leu 100 105 110 Asp Arg Gly Pro Ser Phe Lys Pro Tyr Gly Gly ThrAla Tyr Asn Pro 115 120 125 Leu Ala Pro Arg Glu Ala Ile Phe Asn Thr TrpVal Glu Ser Thr Gly 130 135 140 Pro Gln Thr Asn Val Val Gly Gln Met ThrAsn Val Tyr Thr Asn Gln 145 150 155 160 Thr Arg Asn Asp Lys Thr Ala ThrLeu Gln Gln Val Asn Ser Ile Ser 165 170 175 Gly Val Val Pro Asn Val AsnLeu Gly Pro Gly Leu Ser Gln Leu Ala 180 185 190 Ser Arg Ala Asp Val AspAsn Ile Gly Val Val Gly Arg Phe Ala Lys 195 200 205 Val Asp Ser Ala GlyVal Lys Gln Ala Tyr Gly Ala Tyr Val Lys Pro 210 215 220 Val Lys Asp AspGly Ser Gln Ser Leu Asn Gln Thr Ala Tyr Trp Leu 225 230 235 240 Met AspAsn Gly Gly Thr Asn Tyr Leu Gly Ala Leu Ala Val Glu Asp 245 250 255 TyrThr Gln Thr Leu Ser Tyr Pro Asp Thr Val Leu Val Thr Pro Pro 260 265 270Thr Ala Tyr Gln Gln Val Asn Ser Gly Thr Met Arg Ala Cys Arg Pro 275 280285 Asn Tyr Ile Gly Phe Arg Asp Asn Phe Ile Asn Leu Leu Tyr His Asp 290295 300 Ser Gly Val Cys Ser Gly Thr Leu Asn Ser Glu Arg Ser Gly Met Asn305 310 315 320 Val Val Val Glu Leu Gln Asp Arg Asn Thr Glu Leu Ser TyrGln Tyr 325 330 335 Met Leu Ala Asp Met Met Ser Arg His His Tyr Phe AlaLeu Trp Asn 340 345 350 Gln Ala Val Asp Gln Tyr Asp His Asp Val Arg ValPhe Asn Asn Asp 355 360 365 Gly Tyr Glu Glu Gly Val Pro Thr Tyr Ala PheLeu Pro Asp Gly His 370 375 380 Gly Ala Gly Glu Asp Asn Gly Pro Asp LeuSer Asn Val Lys Ile Tyr 385 390 395 400 Thr Asn Gly Gln Gln Asp Lys GlyAsn Val Val Ala Gly Thr Val Ser 405 410 415 Thr Gln Leu Asn Phe Gly ThrIle Pro Ser Tyr Glu Ile Asp Ile Ala 420 425 430 Ala Ala Thr Arg Arg AsnPhe Ile Met Ser Asn Ile Ala Asp Tyr Leu 435 440 445 Pro Asp Lys Tyr LysPhe Ser Ile Arg Gly Phe Asp Pro Val Thr Asp 450 455 460 Asn Ile Asp ProThr Thr Tyr Phe Tyr Met Asn Arg Arg Val Pro Leu 465 470 475 480 Thr AsnVal Val Asp Leu Phe Thr Asn Ile Gly Ala Arg Trp Ser Val 485 490 495 AspGln Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn Trp Gly 500 505 510Leu Lys Tyr Arg Ser Gln Leu Leu Gly Asn Ser Arg Tyr Cys Arg Phe 515 520525 His Ile Gln Val Pro Gln Lys Tyr Phe Ala Ile Lys Asn Leu Leu Leu 530535 540 Leu Pro Gly Thr Tyr Thr Tyr Glu Trp Val Leu Arg Lys Asp Pro Asn545 550 555 560 Met Ile Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Ala AspGly Ala 565 570 575 Gln Ile Val Tyr Thr Glu Val Asn Leu Met Ala Asn PheMet Pro Met 580 585 590 Asp His Asn Thr Ser Asn Gln Leu Glu Leu Met LeuArg Asn Ala Thr 595 600 605 Asn Asp Gln Thr Phe Ala Asp Tyr Leu Gly AlaLys Asn Ala Leu Tyr 610 615 620 Asn Val Pro Ala Gly Ser Thr Leu Leu ThrIle Asn Ile Pro Ala Arg 625 630 635 640 Thr Trp Glu Gly Met Arg Gly TrpSer Phe Thr Arg Leu Lys Ala Ser 645 650 655 Glu Thr Pro Gln Leu Gly AlaGln Tyr Asp Val Gly Phe Lys Tyr Ser 660 665 670 Gly Ser Ile Pro Tyr SerAsp Gly Thr Phe Tyr Leu Ser His Thr Phe 675 680 685 Arg Ser Met Ser ValLeu Phe Asp Thr Ser Ile Asn Trp Pro Gly Asn 690 695 700 Asp Arg Leu LeuThr Pro Asn Leu Phe Glu Ile Lys Arg Pro Val Ala 705 710 715 720 Thr AspSer Glu Gly Phe Thr Met Ser Gln Cys Asp Met Thr Lys Asp 725 730 735 TrpPhe Leu Val Gln Met Ala Thr Asn Tyr Asn Tyr Val Tyr Asn Gly 740 745 750Tyr Arg Phe Trp Pro Asp Arg His Tyr Phe His Tyr Asp Phe Leu Arg 755 760765 Asn Phe Asp Pro Met Ser Arg Gln Gly Pro Asn Phe Leu Asp Thr Thr 770775 780 Leu Tyr Asp Leu Val Ser Ser Thr Pro Val Val Asn Asp Thr Gly Ser785 790 795 800 Gln Pro Ser Gln Asp Asn Val Arg Asn Asn Ser Gly Phe IleAla Pro 805 810 815 Arg Ser Trp Pro Val Trp Thr Ala Gln Gln Gly Glu AlaTrp Pro Ala 820 825 830 Asn Trp Pro Tyr Pro Leu Ile Gly Asn Asp Ala IleSer Ser Asn Gln 835 840 845 Thr Val Asn Tyr Lys Lys Phe Leu Cys Asp AsnTyr Leu Trp Thr Val 850 855 860 Pro Phe Ser Ser Asp Phe Met Tyr Met GlyGlu Leu Thr Asp Leu Gly 865 870 875 880 Gln Asn Pro Met Tyr Thr Asn AsnSer His Ser Met Val Ile Asn Phe 885 890 895 Glu Leu Asp Pro Met Asp GluAsn Thr Tyr Val Tyr Met Leu Tyr Gly 900 905 910 Val Phe Asp Thr Val ArgVal Asn Gln Pro Glu Arg Asn Val Leu Ala 915 920 925 Met Ala Tyr Phe ArgThr Pro Phe Ala Thr Gly Asn Ala Val 930 935 940 <210> SEQ ID NO 13 <211>LENGTH: 206 <212> TYPE: PRT <213> ORGANISM: CELO Virus <220> FEATURE:<223> OTHER INFORMATION: Position: 21134..21754 /gene: L3 /product: L3protease <400> SEQUENCE: 13 Met Ser Gly Thr Thr Glu Thr Gln Leu Arg AspLeu Leu Ser Ser Met 1 5 10 15 His Leu Arg His Arg Phe Leu Gly Val PheAsp Lys Ser Phe Pro Gly 20 25 30 Phe Leu Asp Pro His Val Pro Ala Ser AlaIle Val Asn Thr Gly Ser 35 40 45 Arg Ala Ser Gly Gly Met His Trp Ile GlyPhe Ala Phe Asp Pro Ala 50 55 60 Ala Gly Arg Cys Tyr Met Phe Asp Pro PheGly Trp Ser Asp Gln Lys 65 70 75 80 Leu Trp Glu Leu Tyr Arg Val Lys TyrAsn Ala Phe Met Arg Arg Thr 85 90 95 Gly Leu Arg Gln Pro Asp Arg Cys PheThr Leu Val Arg Ser Thr Glu 100 105 110 Ala Val Gln Cys Pro Cys Ser AlaAla Cys Gly Leu Phe Ser Ala Leu 115 120 125 Phe Ile Val Ser Phe Asp ArgTyr Arg Ser Lys Pro Met Asp Gly Asn 130 135 140 Pro Val Ile Asp Thr ValVal Gly Val Lys His Glu Asn Met Asn Ser 145 150 155 160 Pro Pro Tyr ArgAsp Ile Leu His Arg Asn Gln Glu Arg Thr Tyr Tyr 165 170 175 Trp Trp ThrLys Asn Ser Ala Tyr Phe Arg Ala His Gln Glu Glu Leu 180 185 190 Arg ArgGlu Thr Ala Leu Asn Ala Leu Pro Glu Asn His Val 195 200 205 <210> SEQ IDNO 14 <211> LENGTH: 984 <212> TYPE: PRT <213> ORGANISM: CELO Virus <220>FEATURE: <223> OTHER INFORMATION: Position: 23680..26634 /gene: L4/product: L4 100K <400> SEQUENCE: 14 Met Ala Asp Lys Ile Thr Arg Glu GluLys Thr Ile Ala Thr Leu Asp 1 5 10 15 Leu Val Leu Arg Val Val Val AspAla Gly Asn Trp Asp Val Phe Ser 20 25 30 Lys Arg Leu Val Arg Tyr Thr ArgGlu Gln Tyr Gly Ile Glu Leu Pro 35 40 45 Glu Asp Ile Gly Asp Leu Pro AspThr Ser Glu Val Ser Lys Val Leu 50 55 60 Leu Ser His Leu Gly Glu Asp LysAla Val Leu Ser Ala Tyr Arg Ile 65 70 75 80 Ala Glu Leu Thr Gln Pro SerGlu Met Asp Arg Ala Lys Val Thr Glu 85 90 95 Gly Gly Leu Ala Val Leu AsnAla Ser Arg Asp Glu Ser Glu Ala Gln 100 105 110 Asn Pro Ser Asn Pro GluPro Glu Ser Ile Glu Ser Asp Ala Val Glu 115 120 125 Asp Leu Gly Val AlaAla Glu Ser Asp Pro Ser Asp Asp Glu Pro Asp 130 135 140 Pro Glu Pro GluTyr Asp His Arg Glu Ala Asp His Asp Ser Asp Ala 145 150 155 160 Asp SerGly Tyr Tyr Ser Ala Asp Gly Gly Arg Pro Gly Thr Pro Val 165 170 175 AspGlu Glu Pro Gln Asp Asp Ser Pro Ser Ser Glu Glu Thr Ala Ser 180 185 190Thr Val Ile Glu Glu Ala Gln Thr Ser Ala Ser Asn Asp Ser His Asp 195 200205 Asp Asp Thr His Arg Asp Asp Gly Ser Ala Ser Glu Glu Asp Leu Glu 210215 220 Arg Asp Ala Leu Val Ala Pro Ala Asp Pro Phe Pro Asn Leu Arg Lys225 230 235 240 Cys Phe Glu Arg Gln Ala Met Met Leu Thr Gly Ala Leu LysAsp Ala 245 250 255 Ala Asp Thr Ala Asp Pro Pro Glu Thr Leu Ser Val AspSer Val Gln 260 265 270 Arg Gln Leu Glu Arg Phe Val Phe Asn Pro Asp ArgArg Val Pro Ala 275 280 285 Glu His Leu Glu Val Arg Tyr Asn Phe Tyr ProPro Phe Leu Thr Pro 290 295 300 Lys Ala Ile Ala Ser Tyr His Ile Phe AlaVal Thr Ala Ser Ile Pro 305 310 315 320 Leu Ser Cys Lys Ala Asn Arg SerGly Ser Asp Leu Leu Ala Lys Ala 325 330 335 Lys Glu Ser Thr Phe Phe LysArg Leu Pro Lys Trp Arg Leu Gly Ile 340 345 350 Glu Ile Asp Asp Gly LeuGly Thr Glu Val Thr Ala Val Thr Glu Leu 355 360 365 Glu Glu Ala Lys MetVal Pro Leu Lys Asp Asp Val Ser Arg Leu Gln 370 375 380 Trp Ala Lys MetArg Gly Glu His Ile Arg Phe Phe Ser Tyr Pro Ser 385 390 395 400 Leu HisMet Pro Pro Lys Ile Ser Arg Met Leu Met Glu Thr Leu Leu 405 410 415 GlnPro Phe Ala Asp Glu Asn Gln Lys Ala Glu Glu Ala Leu Pro Cys 420 425 430Leu Ser Asp Glu Glu Val Leu Ala Ile Val Asp Pro Thr Gly Arg Leu 435 440445 His Gly Glu Asp Ala Leu Lys Ala Val Glu Lys Arg Arg Ala Ala Val 450455 460 Thr Met Ala Val Arg Tyr Thr Ala Thr Leu Glu Leu Met Glu Arg Val465 470 475 480 Phe Arg Glu Pro Ser Met Val Lys Lys Met Gln Glu Val LeuHis His 485 490 495 Thr Phe His His Gly Phe Val Ala Leu Val Arg Glu ThrAla Lys Val 500 505 510 Asn Leu Ser Asn Tyr Ala Thr Phe His Gly Leu ThrTyr Asn Asn Pro 515 520 525 Leu Asn Asn Cys Ile Met Ser Lys Leu Leu GluGly Ala Asp Lys Glu 530 535 540 Asp Tyr Val Val Asp Ser Ile Tyr Leu PheLeu Val Leu Thr Trp Gln 545 550 555 560 Thr Ala Met Gly Met Trp Gln GlnAla Ile Asp Asp Met Thr Ile Gln 565 570 575 Met Tyr Thr Glu Val Phe ThrLys Asn Lys Tyr Arg Leu Tyr Ser Leu 580 585 590 Pro Asn Pro Thr Ala IleGly Lys Ala Ile Val Asp Ile Leu Met Asp 595 600 605 Tyr Asp Arg Leu ThrGlu Glu Met Arg Lys Ala Leu Pro Asn Phe Thr 610 615 620 Cys Gln Ser GlnIle Thr Ala Phe Arg His Phe Leu Leu Glu Arg Ser 625 630 635 640 Asn IlePro Ala Val Ala Ala Pro Phe Met Pro Ser Asp Phe Val Pro 645 650 655 LeuAla Tyr Lys Gln Ser Pro Pro Leu Leu Trp Asp Gln Val Tyr Leu 660 665 670Leu Gln Leu Ala Phe Tyr Leu Thr Lys His Gly Gly Tyr Leu Trp Glu 675 680685 Ala Pro Glu Glu Glu Ala Asn Asn Pro Ser Asn Arg Thr Tyr Cys Pro 690695 700 Cys Asn Leu Cys Ser Pro His Arg Met Pro Gly His Asn Ala Ala Leu705 710 715 720 His Asn Glu Ile Leu Ala Ile Gly Thr Phe Glu Ile Arg SerPro Asp 725 730 735 Gly Lys Thr Phe Lys Leu Thr Pro Glu Leu Trp Thr AsnAla Tyr Leu 740 745 750 Asp Lys Phe Asp Ala Glu Asp Phe His Pro Phe ThrVal Phe His Tyr 755 760 765 Pro Glu Asn Ala Ser Arg Phe Ala Ser Thr LeuLys Ala Cys Val Thr 770 775 780 Gln Ser Pro Glu Ile Leu Ser Leu Ile ArgGln Ile Gln Glu Ser Arg 785 790 795 800 Glu Glu Phe Leu Leu Thr Lys GlyLys Gly Val Tyr Lys Asp Pro Asn 805 810 815 Thr Gly Glu Thr Ile Ser ArgGln Pro Arg Asp Thr Ala Arg Ala Gln 820 825 830 His Ala Gly Asp Gly GlnAla Leu Pro Ala Pro Gly Ala Tyr Thr Thr 835 840 845 Gly Gly Asn Arg AlaGlu Thr Ala Pro Ala Gly Ala Val Arg Leu Ala 850 855 860 Pro Asp Tyr GlnAsp Gly Gln Phe Pro Ile Ala Lys Val Gly Pro His 865 870 875 880 Tyr HisGly Pro Lys Asn Val Arg Arg Glu Asp Gln Gly Tyr Arg Gly 885 890 895 GlyPro Gly Gly Val Arg Gly Glu Arg Glu Val Val Leu Ser Arg Arg 900 905 910Ala Gly Gly Arg Arg Phe Gly Arg Arg Asn Thr Arg Gln Ser Gly Tyr 915 920925 Asn Glu Arg Ala Asn Arg Tyr Phe Gly Arg Gly Gly Gly Gly Ser Val 930935 940 Arg Gly Gln Gln Gly Glu His Pro Thr Thr Ser Pro Ser Ala Ser Glu945 950 955 960 Pro Pro Ala Pro Ser Arg Ile Leu Ala Arg Gly Thr Pro ProSer Pro 965 970 975 Glu Arg Arg Asp Arg Gln Glu Glu 980 <210> SEQ ID NO15 <211> LENGTH: 245 <212> TYPE: PRT <213> ORGANISM: CELO Virus <220>FEATURE: <223> OTHER INFORMATION: Position: 27149..27886 /gene: L4/product: L4 pVIII <400> SEQUENCE: 15 Met Asn Leu Met Asn Ala Thr ProThr Glu Tyr Val Trp Lys Tyr Asn 1 5 10 15 Pro Val Ser Gly Ile Pro AlaGly Ala Gln Gln Asn Tyr Gly Ala Thr 20 25 30 Ile Asp Trp Val Leu Pro GlyGly Thr Gly Phe Ala Ile Ala Thr Asn 35 40 45 Asp Ile Arg Arg Gln Thr LeuAsn Pro Ala Val Thr Arg Ala Ile Thr 50 55 60 Ala Arg Phe Glu Ala Glu SerAsp Gln Gln Pro Tyr Ala Ser Pro His 65 70 75 80 Glu Thr Asn Val Ile AlaAla Asn Val Leu Asp Ser Gly Tyr Pro Lys 85 90 95 Ser Gly Leu Tyr Pro LeuGlu Leu Ser Gly Asn Gln Arg Val Gln Leu 100 105 110 Ala Gly Gly Leu MetVal Gly Arg Thr Glu Gly Arg Met Gln Leu Ala 115 120 125 Gly Gly Leu ThrGlu Gly Arg Val Gln Leu Ser Gly Gly Phe His Gly 130 135 140 Arg Pro LeuVal Arg Gly Arg Ser Arg Arg Pro Pro Arg Trp Cys Gly 145 150 155 160 AlaGlu Leu Thr Gly Asn Gly Leu Pro Glu Gln Ala Glu Val Thr Ser 165 170 175Asp Thr Tyr Lys Tyr Phe Leu Arg Thr Gln Gly Pro Ser Gln Val Val 180 185190 Glu Glu Pro Gly Val Phe Ser Gln Arg Gln Phe Met Thr Thr Phe Leu 195200 205 Pro Ser Val Val Pro His Pro Phe Asp Ser Thr Asn Pro Gly Asp Phe210 215 220 Pro Ala Gln Tyr Ser Ala Ile Tyr Lys Gly Arg Thr Ala Phe GluAsp 225 230 235 240 Thr Phe Trp Asp Trp 245 <210> SEQ ID NO 16 <211>LENGTH: 710 <212> TYPE: PRT <213> ORGANISM: CELO Virus <220> FEATURE:<223> OTHER INFORMATION: Position: 28363..30495 /gene: L5 /product: L5fibre 1 <400> SEQUENCE: 16 Met Thr Ser Pro Leu Thr Leu Ser Gln Arg AlaLeu Ala Leu Lys Thr 1 5 10 15 Asp Ser Thr Leu Thr Leu Asn Thr Gln GlyGln Leu Gly Val Ser Leu 20 25 30 Thr Pro Gly Asp Gly Leu Val Leu Asn ThrAsn Gly Leu Ser Ile Asn 35 40 45 Ala Asp Pro Gln Thr Leu Ala Phe Asn AsnSer Gly Ala Leu Glu Val 50 55 60 Asn Leu Asp Pro Asp Gly Pro Trp Ser LysThr Ala Thr Gly Ile Asp 65 70 75 80 Leu Arg Leu Asp Pro Thr Thr Leu GluVal Asp Asn Trp Glu Leu Gly 85 90 95 Val Lys Leu Asp Pro Asp Glu Ala IleAsp Ser Gly Pro Asp Gly Leu 100 105 110 Cys Leu Asn Leu Asp Glu Thr LeuLeu Leu Ala Thr Asn Ser Thr Ser 115 120 125 Gly Lys Thr Glu Leu Gly ValHis Leu Asn Thr Ser Gly Pro Ile Thr 130 135 140 Ala Asp Asp Gln Gly IleAsp Leu Asp Val Asp Pro Asn Thr Met Gln 145 150 155 160 Val Asn Thr GlyPro Ser Gly Gly Met Leu Ala Val Lys Leu Lys Ser 165 170 175 Gly Gly GlyLeu Thr Ala Asp Pro Asp Gly Ile Ser Val Thr Ala Thr 180 185 190 Val AlaPro Pro Ser Ile Ser Ala Thr Ala Pro Leu Thr Tyr Thr Ser 195 200 205 GlyThr Ile Ala Leu Thr Thr Asp Thr Gln Thr Met Gln Val Asn Ser 210 215 220Asn Gln Leu Ala Val Lys Leu Lys Thr Gly Gly Gly Leu Thr Ala Asp 225 230235 240 Ala Asp Gly Ile Ser Val Ser Val Ala Pro Thr Pro Thr Ile Ser Ala245 250 255 Ser Pro Pro Leu Thr Tyr Thr Asn Gly Gln Ile Gly Leu Ser IleGly 260 265 270 Asp Gln Ser Leu Gln Val Ser Ser Gly Gln Leu Gln Val LysLeu Lys 275 280 285 Ser Gln Gly Gly Ile Gln Gln Ser Thr Gln Gly Leu GlyVal Ala Val 290 295 300 Asp Gln Thr Leu Lys Ile Val Ser Asn Thr Leu GluVal Asn Thr Asp 305 310 315 320 Pro Ser Gly Pro Leu Thr Ser Gly Asn AsnGly Leu Ser Leu Ala Ala 325 330 335 Val Thr Pro Leu Ala Val Ser Ser AlaGly Val Thr Leu Asn Tyr Gln 340 345 350 Ser Pro Leu Thr Val Thr Ser AsnSer Leu Gly Leu Ser Ile Ala Ala 355 360 365 Pro Leu Gln Ala Gly Ala GlnGly Leu Thr Val Asn Thr Met Glu Pro 370 375 380 Leu Ser Ala Ser Ala GlnGly Ile Gln Leu His Tyr Gly Gln Gly Phe 385 390 395 400 Gln Val Val AlaGly Thr Leu Gln Leu Leu Thr Asn Pro Pro Ile Val 405 410 415 Val Ser SerArg Gly Phe Thr Leu Leu Tyr Thr Pro Ala Phe Thr Val 420 425 430 Ser AsnAsn Met Leu Gly Leu Asn Val Asp Gly Thr Asp Cys Val Ala 435 440 445 IleSer Ser Ala Gly Leu Gln Ile Arg Lys Glu Ala Pro Leu Tyr Val 450 455 460Thr Ser Gly Ser Thr Pro Ala Leu Ala Leu Lys Tyr Ser Ser Asp Phe 465 470475 480 Thr Ile Thr Asn Gly Ala Leu Ala Leu Ala Asn Ser Gly Gly Gly Gly485 490 495 Ser Ser Thr Pro Glu Val Ala Thr Tyr His Cys Gly Asp Asn LeuLeu 500 505 510 Glu Ser Tyr Asp Ile Phe Ala Ser Leu Pro Asn Thr Asn AlaAla Lys 515 520 525 Val Ala Ala Tyr Cys Arg Leu Ala Ala Ala Gly Gly ValVal Ser Gly 530 535 540 Thr Ile Gln Val Thr Ser Tyr Ala Gly Arg Trp ProLys Val Gly Asn 545 550 555 560 Ser Val Thr Asp Gly Ile Lys Phe Ala IleVal Val Ser Pro Pro Met 565 570 575 Asp Lys Asp Pro Arg Ser Asn Leu SerGln Trp Leu Gly Ala Thr Val 580 585 590 Phe Pro Ala Gly Ala Thr Thr AlaLeu Phe Ser Pro Asn Pro Tyr Gly 595 600 605 Ser Leu Asn Thr Ile Thr ThrLeu Pro Ser Ile Ala Ser Asp Trp Tyr 610 615 620 Val Pro Glu Ser Asn LeuVal Thr Tyr Thr Lys Ile His Phe Lys Pro 625 630 635 640 Thr Gly Ser GlnGln Leu Gln Leu Ala Ser Gly Glu Leu Val Val Ala 645 650 655 Ala Ala LysSer Pro Val Gln Thr Thr Lys Tyr Glu Leu Ile Tyr Leu 660 665 670 Gly PheThr Leu Lys Gln Asn Ser Ser Gly Thr Asn Phe Phe Asp Pro 675 680 685 AsnAla Ser Ser Asp Leu Ser Phe Leu Thr Pro Pro Ile Pro Phe Thr 690 695 700Tyr Leu Gly Tyr Tyr Gln 705 710 <210> SEQ ID NO 17 <211> LENGTH: 410<212> TYPE: PRT <213> ORGANISM: CELO Virus <220> FEATURE: <223> OTHERINFORMATION: Position: 30536..31768 /gene: L5 /product: L5 fibre 2 <400>SEQUENCE: 17 Met Ala Asp Gln Lys Arg Lys Leu Ala Asp Pro Asp Ala Glu AlaPro 1 5 10 15 Thr Gly Lys Met Ala Arg Ala Gly Pro Gly Glu Leu Asp LeuVal Tyr 20 25 30 Pro Phe Trp Tyr Gln Val Ala Ala Pro Thr Glu Ile Thr ProPro Phe 35 40 45 Leu Asp Pro Asn Gly Pro Leu Tyr Ser Thr Asp Gly Leu LeuAsn Val 50 55 60 Arg Leu Thr Ala Pro Leu Val Ile Ile Arg Gln Ser Asn GlyAsn Ala 65 70 75 80 Ile Gly Val Lys Thr Asp Gly Ser Ile Thr Val Asn AlaAsp Gly Ala 85 90 95 Leu Gln Ile Gly Ile Ser Thr Ala Gly Pro Leu Thr ThrThr Ala Asn 100 105 110 Gly Ile Asp Leu Asn Ile Asp Pro Lys Thr Leu ValVal Asp Gly Ser 115 120 125 Ser Gly Lys Asn Val Leu Gly Val Leu Leu LysGly Gln Gly Ala Leu 130 135 140 Gln Ser Ser Ala Gln Gly Ile Gly Val AlaVal Asp Glu Ser Leu Gln 145 150 155 160 Ile Val Asp Asn Thr Leu Glu ValLys Val Asp Ala Ala Gly Pro Leu 165 170 175 Ala Val Thr Ala Ala Gly ValGly Leu Gln Tyr Asp Asn Thr Gln Phe 180 185 190 Lys Val Thr Asn Gly ThrLeu Gln Leu Tyr Gln Ala Pro Thr Ser Ser 195 200 205 Val Ala Ala Phe ThrSer Gly Thr Ile Gly Leu Ser Ser Pro Thr Gly 210 215 220 Asn Phe Val SerSer Ser Asn Asn Pro Phe Asn Gly Ser Tyr Phe Leu 225 230 235 240 Gln GlnIle Asn Thr Met Gly Met Leu Thr Thr Ser Leu Tyr Val Lys 245 250 255 ValAsp Thr Thr Thr Met Gly Thr Arg Pro Thr Gly Ala Val Asn Glu 260 265 270Asn Ala Arg Tyr Phe Thr Val Trp Val Ser Ser Phe Leu Thr Gln Cys 275 280285 Asn Pro Ser Asn Ile Gly Gln Gly Thr Leu Glu Pro Ser Asn Ile Ser 290295 300 Met Thr Ser Phe Glu Pro Ala Arg Asn Pro Ile Ser Pro Pro Val Phe305 310 315 320 Asn Met Asn Gln Asn Ile Pro Tyr Tyr Ala Ser Arg Phe GlyVal Leu 325 330 335 Glu Ser Tyr Arg Pro Ile Phe Thr Gly Ser Leu Asn ThrGly Ser Ile 340 345 350 Asp Val Arg Met Gln Val Thr Pro Val Leu Ala ThrAsn Asn Thr Thr 355 360 365 Tyr Asn Leu Ile Ala Phe Thr Phe Gln Cys AlaSer Ala Gly Leu Phe 370 375 380 Asn Pro Thr Val Asn Gly Thr Val Ala IleGly Pro Val Val His Thr 385 390 395 400 Cys Pro Ala Ala Arg Ala Pro ValThr Val 405 410 <210> SEQ ID NO 18 <211> LENGTH: 148 <212> TYPE: PRT<213> ORGANISM: CELO Virus <220> FEATURE: <223> OTHER INFORMATION:Position: 33030..33476 /note=ORF5 <400> SEQUENCE: 18 Met Arg Leu Pro ValLeu Pro Val His Met Lys Tyr Pro Leu Phe Asp 1 5 10 15 Val Ser Val GlyGln Pro Thr Val Thr Gly Glu Val Thr Gln Ser Leu 20 25 30 Arg His Asp ArgPro Gln Phe Arg Arg His His His Ala Glu His Ala 35 40 45 Ser Gln Ala AspGly Leu Gly His Gly Ala Ala Ala Arg Asn Ser Phe 50 55 60 His His Asp GlyGly Arg His Gly His Ala Thr Arg Ile His Glu Asn 65 70 75 80 Asn Arg ArgPro His Lys Arg Asn Arg Arg Arg His Leu Arg Lys Gly 85 90 95 His Leu LysAla His Arg Ala Glu Ser His Leu Tyr Gly Tyr Leu Leu 100 105 110 Arg AsnGln Lys Ala Cys Gly Glu Lys Leu Lys Ile Cys Leu Pro Ala 115 120 125 SerLys His Pro Arg Phe Gln Val Val Leu His Pro Arg Cys Asn Lys 130 135 140Lys Gln Pro Thr 145 <210> SEQ ID NO 19 <211> LENGTH: 104 <212> TYPE: PRT<213> ORGANISM: CELO Virus <220> FEATURE: <223> OTHER INFORMATION:Position: 33169..33483 /note=ORF6 <400> SEQUENCE: 19 Met Leu Leu Arg GlnThr Gly Trp Val Thr Ala Leu Leu Arg Glu Ile 1 5 10 15 Val Ser Thr MetMet Val Ala Val Thr Val Thr Arg His Ala Phe Met 20 25 30 Arg Thr Thr GlyAsp Arg Thr Lys Gly Thr Asp Glu Gly Thr Cys Glu 35 40 45 Lys Asp Thr SerLys Leu Ile Glu Arg Arg Ala Ile Phe Thr Asp Ile 50 55 60 Phe Ser Ala IleArg Lys Pro Val Val Lys Asn Leu Lys Ser Val Phe 65 70 75 80 Leu Arg AlaSer Ile His Gly Ser Lys Ser Tyr Phe Ile Pro Gly Val 85 90 95 Ile Lys SerAsn Leu Arg Glu Arg 100 <210> SEQ ID NO 20 <211> LENGTH: 131 <212> TYPE:PRT <213> ORGANISM: CELO Virus <220> FEATURE: <223> OTHER INFORMATION:Position: 35629..36024 /note= ORF7 <400> SEQUENCE: 20 Met Asn Ser MetVal Leu Glu Leu Arg Lys Lys Met Ser Ser Gly Pro 1 5 10 15 Asp Cys ValIle Gly Arg Pro Pro His Ile Leu Pro Pro Gln Lys Gly 20 25 30 Val Tyr LeuLeu Thr Asn Ile Ser Gln Leu Ile Gly Pro Val Gln Gln 35 40 45 Asn Asp ArgGly Leu Trp Arg His Asn Gly Met His Thr Gln Asn Leu 50 55 60 Ser His HisPhe Thr Gly Pro Phe Ile Cys Ala Val Ile Ala Arg Pro 65 70 75 80 Ile AsnLys Arg Thr His Ile Gly Ile His Val Leu Asn Gln Asn Asn 85 90 95 Glu LeuPro Ala Ile Phe Thr Ile Gln Tyr Pro Glu Pro Pro His Leu 100 105 110 ThrAsp Asn Pro Gly Ala Val Arg Lys Ser Gln Lys Ser Leu Ile Pro 115 120 125Pro Tyr Asn 130 <210> SEQ ID NO 21 <211> LENGTH: 282 <212> TYPE: PRT<213> ORGANISM: CELO Virus <220> FEATURE: <223> OTHER INFORMATION:Position: 37391..38239 /note=ORF8 <400> SEQUENCE: 21 Met Ala Arg Asn ProPhe Arg Met Phe Pro Gly Asp Leu Pro Tyr Tyr 1 5 10 15 Met Gly Thr IleSer Phe Thr Ser Val Val Pro Val Asp Pro Ser Gln 20 25 30 Arg Asn Pro ThrThr Ser Leu Arg Glu Met Val Thr Thr Gly Leu Ile 35 40 45 Phe Asn Pro AsnLeu Thr Gly Glu Gln Leu Arg Glu Tyr Ser Phe Ser 50 55 60 Pro Leu Val SerMet Gly Arg Lys Ala Ile Phe Ala Asp Tyr Glu Gly 65 70 75 80 Pro Gln ArgIle Ile His Val Thr Ile Arg Gly Arg Ser Ala Glu Pro 85 90 95 Lys Thr ProSer Glu Ala Leu Ile Met Met Glu Lys Ala Val Arg Gly 100 105 110 Ala PheAla Val Pro Asp Trp Val Ala Arg Glu Tyr Ser Asp Pro Leu 115 120 125 ProHis Gly Ile Thr His Val Gly Asp Leu Gly Phe Pro Ile Gly Ser 130 135 140Val His Ala Leu Lys Met Ala Leu Asp Thr Leu Lys Ile His Val Pro 145 150155 160 Arg Gly Val Gly Val Pro Gly Tyr Glu Gly Leu Cys Gly Thr Thr Thr165 170 175 Ile Lys Ala Pro Arg Gln Tyr Arg Leu Leu Thr Thr Gly Val PheThr 180 185 190 Lys Lys Asp Leu Lys Arg Thr Leu Pro Glu Pro Phe Phe SerArg Phe 195 200 205 Phe Asn Gln Thr Pro Glu Val Cys Ala Ile Lys Thr GlyLys Asn Pro 210 215 220 Phe Ser Thr Glu Ile Trp Cys Met Thr Leu Gly GlyAsp Ser Pro Ala 225 230 235 240 Pro Glu Arg Asn Glu Pro Arg Asn Pro HisSer Leu Gln Asp Trp Ala 245 250 255 Arg Leu Gly Val Met Glu Thr Cys LeuArg Met Ser Arg Arg Gly Leu 260 265 270 Gly Ser Arg His His Pro Tyr HisSer Leu 275 280 <210> SEQ ID NO 22 <211> LENGTH: 321 <212> TYPE: PRT<213> ORGANISM: CELO Virus <220> FEATURE: <223> OTHER INFORMATION:Position: 40037..41002 /note=ORF9 <400> SEQUENCE: 22 Met Glu Pro Pro HisAsn Ser Pro Val Pro Phe Ser Ile Ala Lys Met 1 5 10 15 Gly Asn Pro ThrLeu Leu Leu Leu Ser Gly Leu Leu Ser Leu Thr Gln 20 25 30 Ala Ile Ser IleGly Glu His Glu Asn Lys Thr Arg His Val Ile Val 35 40 45 Trp Arg His SerSer Ser His Gln Cys Ser Asp Trp Arg Thr Val Thr 50 55 60 Glu Trp Phe ProPro Gln Lys Gly Asn Pro Val Arg Pro Pro Tyr Thr 65 70 75 80 Gln Arg ValSer Leu Asp Thr Ala Asn Asn Thr Leu Thr Val Lys Pro 85 90 95 Phe Glu ThrAsn Asn Gly Cys Trp Glu Thr Thr Ser Gln Gly Ile Asn 100 105 110 His ProPro Thr Thr Ile Gln Tyr Arg Val Trp Asn Ile Thr Thr Thr 115 120 125 ProThr Ile Gln Thr Ile Asn Ile Thr Lys Ile Thr Val Arg Glu Gly 130 135 140Glu Asp Phe Thr Leu Tyr Gly Pro Val Ser Glu Thr Met Ser Ile Ile 145 150155 160 Glu Trp Glu Phe Ile Lys Asp Val Thr Pro Gln Phe Ile Leu Gln Tyr165 170 175 Tyr Leu Ser Ile Asn Ser Thr Ile Val Tyr Ala Ser Tyr Gln GlyArg 180 185 190 Val Thr Phe Asn Pro Gly Lys Asn Thr Leu Thr Leu Lys GlyAla Lys 195 200 205 Thr Thr Asp Ser Gly Thr Tyr Lys Ser Thr Val Asn LeuAsp Gln Val 210 215 220 Ser Val His Asn Phe Arg Val Gly Val Thr Pro IleGlu Lys Lys Glu 225 230 235 240 Glu Ala Thr Ala Glu Thr Pro Ala Ser LysPro Thr Pro Ile Pro Arg 245 250 255 Val Arg Ala Asp Ala Arg Ser Thr AlaLeu Trp Val Gly Leu Ala Leu 260 265 270 Cys Ile Leu Thr Val Ile Pro AlaLeu Ile Gly Trp Tyr Phe Arg Asp 275 280 285 Arg Leu Cys Val Pro Asp ProIle Ile Glu Leu Glu Ile Pro Gly Gln 290 295 300 Pro His Val Thr Ile HisIle Leu Lys Gly Pro Asp Asp Asp Cys Glu 305 310 315 320 Thr <210> SEQ IDNO 23 <211> LENGTH: 283 <212> TYPE: PRT <213> ORGANISM: CELO Virus <220>FEATURE: <223> OTHER INFORMATION: Position: 41002..41853 /note=ORF10<400> SEQUENCE: 23 Met Ile Asp Lys Arg Asn Lys Lys Ala Val Thr His IleSer Thr Cys 1 5 10 15 Leu Cys His Ser Ser Ile Pro Ile Tyr Gly Asp SerPro Phe Leu Asn 20 25 30 Thr His Arg Ala Ala Met Asp Pro Arg Pro Leu ValLeu Leu Leu Leu 35 40 45 Leu Ala Ser His Ile Ser Thr Phe Arg Gln Met TyrPhe Glu Gly Glu 50 55 60 Thr Ile His Phe Pro Met Gly Ile Tyr Gly Asn GluThr Thr Leu Tyr 65 70 75 80 Met Asn Asp Ile Ile Leu Glu Gly Thr Arg AlaAsn Thr Thr Thr Arg 85 90 95 Thr Ile Ser Leu Thr Thr Thr Lys Lys Asn AlaGly Thr Asn Leu Tyr 100 105 110 Thr Val Ile Ser Glu Thr Gly His Asn AlaThr Tyr Leu Ile Thr Val 115 120 125 Gln Pro Leu Gly Gln Ser Ile His HisAla Tyr Thr Trp Ala Gly Asn 130 135 140 Thr Phe Thr Leu Gln Gly Gln ValPhe Glu His Gly Asn Tyr Thr Arg 145 150 155 160 Trp Val Arg Leu Glu AsnAla Glu Pro Lys Leu Ile Ile Ser Trp Ala 165 170 175 Leu Ser Asn Arg ThrIle Asn Lys Gly Pro Ala Tyr Thr Ala Asn Met 180 185 190 Asp Phe Asp ProGly Asn Asn Thr Leu Thr Leu His Pro Val Leu Ile 195 200 205 Thr Asp AlaGly Ile Phe Gln Cys Val Ile Asp Gln Gln Thr Asn Leu 210 215 220 Thr LeuThr Ile Asn Phe Thr Val Ser Glu Asn Pro Pro Ile Val Ala 225 230 235 240His Leu Asp Ile His Lys Thr Ile Ser Arg Thr Ile Ala Ile Cys Ser 245 250255 Cys Leu Leu Ile Ala Val Ile Ala Val Leu Cys Cys Leu Arg Gln Leu 260265 270 Asn Val Asn Gly Arg Gly Asn Ser Glu Met Ile 275 280 <210> SEQ IDNO 24 <211> LENGTH: 135 <212> TYPE: PRT <213> ORGANISM: CELO Virus <220>FEATURE: <223> OTHER INFORMATION: Position: 41958..42365 /note= ORF11<400> SEQUENCE: 24 Met Leu Leu Leu Thr Val Val Leu Leu Val Gly Val ThrLeu Ala Ala 1 5 10 15 Asp His Pro Thr Leu Tyr Ala Pro Lys Gly Gly SerIle Glu Leu Gly 20 25 30 Val Gly Ala Lys Gln Lys Gly Gln Tyr Lys Phe GluTrp Arg Phe Gly 35 40 45 Asn Leu Lys Ile Val Ile Ala Glu Met Ser Ser ThrAsn Gln Leu Glu 50 55 60 Ile Lys Phe Pro Asp Asn Gly Phe Gln Asn Arg SerGlu Phe Asn Pro 65 70 75 80 Thr Lys His Asn Leu Thr Ile His Asn Ala SerTyr Glu Asp Ser Gly 85 90 95 Thr Tyr Ser Leu His Gln Glu Glu Asn Asp GlyThr Glu His Thr Asp 100 105 110 Asn Phe Lys Val Ile Val Gln Gly Met SerLeu Tyr Thr Tyr Leu Gln 115 120 125 Tyr Ala Leu Ile Ser Pro Ile 130 135<210> SEQ ID NO 25 <211> LENGTH: 1326 <212> TYPE: DNA <213> ORGANISM:CELO VIRUS <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION:(1)..(1326) <400> SEQUENCE: 25 atg gaa aga acc ccg aaa aga gct cat ggcttt cgc agc acc aag cct 48 Met Glu Arg Thr Pro Lys Arg Ala His Gly PheArg Ser Thr Lys Pro 1 5 10 15 gtc aag aga acg gca gaa gtc atg atg gaagag gag gag gaa gaa gtg 96 Val Lys Arg Thr Ala Glu Val Met Met Glu GluGlu Glu Glu Glu Val 20 25 30 gaa gtg gtc gcc ccg ggc cga ggc gcg act cgcaag aag gtc agc cgc 144 Glu Val Val Ala Pro Gly Arg Gly Ala Thr Arg LysLys Val Ser Arg 35 40 45 cgc gag gag tcc cca tcc ccc gta agg cga gtt acccgc cgg cgg gaa 192 Arg Glu Glu Ser Pro Ser Pro Val Arg Arg Val Thr ArgArg Arg Glu 50 55 60 acc gtt gtc gat gac gaa gaa aac gcc agc gac gag gaatcc ccg gag 240 Thr Val Val Asp Asp Glu Glu Asn Ala Ser Asp Glu Glu SerPro Glu 65 70 75 80 gcc cct ctg tca gac ccc gtg gtc tac ggc gcg caa cgcgcc atg gcc 288 Ala Pro Leu Ser Asp Pro Val Val Tyr Gly Ala Gln Arg AlaMet Ala 85 90 95 acc gtc gcc agc atc tgc gaa gct ctc gac cta cag tgg caggga gcc 336 Thr Val Ala Ser Ile Cys Glu Ala Leu Asp Leu Gln Trp Gln GlyAla 100 105 110 agc gtg cgc ccc gac gac tcc att tgg acc aaa atg ggg ggtaca tac 384 Ser Val Arg Pro Asp Asp Ser Ile Trp Thr Lys Met Gly Gly ThrTyr 115 120 125 gtt cgc aaa aag cat ccc gaa ttt cgc ctg acc ttt tct agctac gac 432 Val Arg Lys Lys His Pro Glu Phe Arg Leu Thr Phe Ser Ser TyrAsp 130 135 140 tct ttc aac gct cag gta ggg cgg ttc ctg gca gcc gtt atctac agc 480 Ser Phe Asn Ala Gln Val Gly Arg Phe Leu Ala Ala Val Ile TyrSer 145 150 155 160 cgc gcg ggt ctg gag ccc aag ttc gtg ccc gga ggg gcgcac gtt tgg 528 Arg Ala Gly Leu Glu Pro Lys Phe Val Pro Gly Gly Ala HisVal Trp 165 170 175 cgc cat ggc tgg ttc cca gcg ctc cag gag ccc ttc ccgaaa tgc atg 576 Arg His Gly Trp Phe Pro Ala Leu Gln Glu Pro Phe Pro LysCys Met 180 185 190 cac ggt gtg gac atg gtg acg aaa cct cgt acc gtg gagttg aac ccg 624 His Gly Val Asp Met Val Thr Lys Pro Arg Thr Val Glu LeuAsn Pro 195 200 205 tct agc gag gcg gga aag agg gct ctg gcc gaa cag aacggc gta atc 672 Ser Ser Glu Ala Gly Lys Arg Ala Leu Ala Glu Gln Asn GlyVal Ile 210 215 220 gag aag aac cgg ttt gga cga cag gtg gtg gtg ctc aggttc gac gcg 720 Glu Lys Asn Arg Phe Gly Arg Gln Val Val Val Leu Arg PheAsp Ala 225 230 235 240 aac gcg gtg tgc tac aag gat cag gag cac agc ggcttc cct cat ccc 768 Asn Ala Val Cys Tyr Lys Asp Gln Glu His Ser Gly PhePro His Pro 245 250 255 cac gcg cac ggc agt tgc gcc atg gtc ttt tcc gacgcc gcc aag gcg 816 His Ala His Gly Ser Cys Ala Met Val Phe Ser Asp AlaAla Lys Ala 260 265 270 gtc agc gcg atg cgt cac gac ata gac tgg acg aaggcg ctt tac ccc 864 Val Ser Ala Met Arg His Asp Ile Asp Trp Thr Lys AlaLeu Tyr Pro 275 280 285 aac gcg gac aag cgc cgg gca gag gaa tgt gtc ctcatc tca acc aat 912 Asn Ala Asp Lys Arg Arg Ala Glu Glu Cys Val Leu IleSer Thr Asn 290 295 300 tgc aac tgc aac tac gcc tcc gat cga gcc att tcaggg aga cag ttc 960 Cys Asn Cys Asn Tyr Ala Ser Asp Arg Ala Ile Ser GlyArg Gln Phe 305 310 315 320 tgt aaa atg act cct tat aag ctc aac ggc acagac gac att act cgc 1008 Cys Lys Met Thr Pro Tyr Lys Leu Asn Gly Thr AspAsp Ile Thr Arg 325 330 335 gac atg gtc gag agc agg ccc gat atg aag gctcac aag aaa aac ccg 1056 Asp Met Val Glu Ser Arg Pro Asp Met Lys Ala HisLys Lys Asn Pro 340 345 350 cat acc atg gtg ttc acc tgc tgc aac ccg caggcg gcg tcg ggc gga 1104 His Thr Met Val Phe Thr Cys Cys Asn Pro Gln AlaAla Ser Gly Gly 355 360 365 gca ggc cgc ggt ctg aag aag acc gaa aaa acctgc gcc tgg cgt ctg 1152 Ala Gly Arg Gly Leu Lys Lys Thr Glu Lys Thr CysAla Trp Arg Leu 370 375 380 tcg gcc atg gat ctg cgc tac gcg tac gtc tttgct acg gag ctg ttt 1200 Ser Ala Met Asp Leu Arg Tyr Ala Tyr Val Phe AlaThr Glu Leu Phe 385 390 395 400 act gcc gtg atg ggt tct tca gag ccc acacat gtg cct gag ttc cgt 1248 Thr Ala Val Met Gly Ser Ser Glu Pro Thr HisVal Pro Glu Phe Arg 405 410 415 tgg aac gag tcg tac gcc ttt aaa acg gaggtg ttg gcg cca gtc tcg 1296 Trp Asn Glu Ser Tyr Ala Phe Lys Thr Glu ValLeu Ala Pro Val Ser 420 425 430 ccc atc gcc agt gac gac ccg ttc gct taa1326 Pro Ile Ala Ser Asp Asp Pro Phe Ala 435 440 <210> SEQ ID NO 26<211> LENGTH: 441 <212> TYPE: PRT <213> ORGANISM: CELO VIRUS <220>FEATURE: <223> OTHER INFORMATION: Position: 21899..23224/Product: E2aDBP <400> SEQUENCE: 26 Met Glu Arg Thr Pro Lys Arg Ala His Gly Phe ArgSer Thr Lys Pro 1 5 10 15 Val Lys Arg Thr Ala Glu Val Met Met Glu GluGlu Glu Glu Glu Val 20 25 30 Glu Val Val Ala Pro Gly Arg Gly Ala Thr ArgLys Lys Val Ser Arg 35 40 45 Arg Glu Glu Ser Pro Ser Pro Val Arg Arg ValThr Arg Arg Arg Glu 50 55 60 Thr Val Val Asp Asp Glu Glu Asn Ala Ser AspGlu Glu Ser Pro Glu 65 70 75 80 Ala Pro Leu Ser Asp Pro Val Val Tyr GlyAla Gln Arg Ala Met Ala 85 90 95 Thr Val Ala Ser Ile Cys Glu Ala Leu AspLeu Gln Trp Gln Gly Ala 100 105 110 Ser Val Arg Pro Asp Asp Ser Ile TrpThr Lys Met Gly Gly Thr Tyr 115 120 125 Val Arg Lys Lys His Pro Glu PheArg Leu Thr Phe Ser Ser Tyr Asp 130 135 140 Ser Phe Asn Ala Gln Val GlyArg Phe Leu Ala Ala Val Ile Tyr Ser 145 150 155 160 Arg Ala Gly Leu GluPro Lys Phe Val Pro Gly Gly Ala His Val Trp 165 170 175 Arg His Gly TrpPhe Pro Ala Leu Gln Glu Pro Phe Pro Lys Cys Met 180 185 190 His Gly ValAsp Met Val Thr Lys Pro Arg Thr Val Glu Leu Asn Pro 195 200 205 Ser SerGlu Ala Gly Lys Arg Ala Leu Ala Glu Gln Asn Gly Val Ile 210 215 220 GluLys Asn Arg Phe Gly Arg Gln Val Val Val Leu Arg Phe Asp Ala 225 230 235240 Asn Ala Val Cys Tyr Lys Asp Gln Glu His Ser Gly Phe Pro His Pro 245250 255 His Ala His Gly Ser Cys Ala Met Val Phe Ser Asp Ala Ala Lys Ala260 265 270 Val Ser Ala Met Arg His Asp Ile Asp Trp Thr Lys Ala Leu TyrPro 275 280 285 Asn Ala Asp Lys Arg Arg Ala Glu Glu Cys Val Leu Ile SerThr Asn 290 295 300 Cys Asn Cys Asn Tyr Ala Ser Asp Arg Ala Ile Ser GlyArg Gln Phe 305 310 315 320 Cys Lys Met Thr Pro Tyr Lys Leu Asn Gly ThrAsp Asp Ile Thr Arg 325 330 335 Asp Met Val Glu Ser Arg Pro Asp Met LysAla His Lys Lys Asn Pro 340 345 350 His Thr Met Val Phe Thr Cys Cys AsnPro Gln Ala Ala Ser Gly Gly 355 360 365 Ala Gly Arg Gly Leu Lys Lys ThrGlu Lys Thr Cys Ala Trp Arg Leu 370 375 380 Ser Ala Met Asp Leu Arg TyrAla Tyr Val Phe Ala Thr Glu Leu Phe 385 390 395 400 Thr Ala Val Met GlySer Ser Glu Pro Thr His Val Pro Glu Phe Arg 405 410 415 Trp Asn Glu SerTyr Ala Phe Lys Thr Glu Val Leu Ala Pro Val Ser 420 425 430 Pro Ile AlaSer Asp Asp Pro Phe Ala 435 440 <210> SEQ ID NO 27 <211> LENGTH: 3366<212> TYPE: DNA <213> ORGANISM: CELO VIRUS <220> FEATURE: <221>NAME/KEY: CDS <222> LOCATION: (1)..(3366) <400> SEQUENCE: 27 atg ctc atcgcc aaa aac gtc acc gga gaa tgg gtc tgg atc acc agc 48 Met Leu Ile AlaLys Asn Val Thr Gly Glu Trp Val Trp Ile Thr Ser 1 5 10 15 cgc act ccggtt caa cag tgt ccc acc tgc ggc cga cac tgg gtc aga 96 Arg Thr Pro ValGln Gln Cys Pro Thr Cys Gly Arg His Trp Val Arg 20 25 30 cga cac tcg tgcaac gaa cgc cgc tct gcc ttc tac tac cac gcc gtc 144 Arg His Ser Cys AsnGlu Arg Arg Ser Ala Phe Tyr Tyr His Ala Val 35 40 45 cag gga tcg ggc agcgat ttg tgg cag cac gtc cac ttc tcc tgt cca 192 Gln Gly Ser Gly Ser AspLeu Trp Gln His Val His Phe Ser Cys Pro 50 55 60 gcc caa cac ccc cac atacgt cag ttg tac atc acc tac gat atc gag 240 Ala Gln His Pro His Ile ArgGln Leu Tyr Ile Thr Tyr Asp Ile Glu 65 70 75 80 acg tat acc gtg ttc gaaaag aaa ggc aag cgc atg cat ccg ttt atg 288 Thr Tyr Thr Val Phe Glu LysLys Gly Lys Arg Met His Pro Phe Met 85 90 95 ttg tgc ttc atg ctc agc ggagac ccc cag ctg gtc tcc cgc gcc gaa 336 Leu Cys Phe Met Leu Ser Gly AspPro Gln Leu Val Ser Arg Ala Glu 100 105 110 cgc tta gca cgg cag gac gaccgt ctc aaa gcc ctc gac gaa ggc ttc 384 Arg Leu Ala Arg Gln Asp Asp ArgLeu Lys Ala Leu Asp Glu Gly Phe 115 120 125 tat tgg cta gac agc cat ccgggc gag gtt gcc aga agg ttt cgc aac 432 Tyr Trp Leu Asp Ser His Pro GlyGlu Val Ala Arg Arg Phe Arg Asn 130 135 140 ttc agg tcc cgt ctg caa atagaa ttt gcc caa aat cta gtc gac cgc 480 Phe Arg Ser Arg Leu Gln Ile GluPhe Ala Gln Asn Leu Val Asp Arg 145 150 155 160 tac gcg gct gcc aac cgagat tat tgc gac cag cta gtc aag gac gga 528 Tyr Ala Ala Ala Asn Arg AspTyr Cys Asp Gln Leu Val Lys Asp Gly 165 170 175 aag tac ggc tcc gtt cacaaa ata ccg tac gag ctc ttc gag aaa ccc 576 Lys Tyr Gly Ser Val His LysIle Pro Tyr Glu Leu Phe Glu Lys Pro 180 185 190 acc tcc ccc ctc tcc ctcccg gat aac ttt tat tcc gta gac atc gta 624 Thr Ser Pro Leu Ser Leu ProAsp Asn Phe Tyr Ser Val Asp Ile Val 195 200 205 gtg cta ggt cac aac atatgt aag ttc gat gaa ctc ctc tta gcc acg 672 Val Leu Gly His Asn Ile CysLys Phe Asp Glu Leu Leu Leu Ala Thr 210 215 220 gaa ctc gtc gag cgc agggac cta ttc ccg gaa gcg tgc aaa tgt gat 720 Glu Leu Val Glu Arg Arg AspLeu Phe Pro Glu Ala Cys Lys Cys Asp 225 230 235 240 cga tcc ttc atg cctcgc gtc ggt cgc ctt ctg ttc aat gat atc att 768 Arg Ser Phe Met Pro ArgVal Gly Arg Leu Leu Phe Asn Asp Ile Ile 245 250 255 ttc cgc atg cca aacccc aac tac gtg aag aaa gac gcc tcc cgc gta 816 Phe Arg Met Pro Asn ProAsn Tyr Val Lys Lys Asp Ala Ser Arg Val 260 265 270 gaa cgc tgg tct cgcggg atc gtg tcc cat cag gat gcg cgc tcg gta 864 Glu Arg Trp Ser Arg GlyIle Val Ser His Gln Asp Ala Arg Ser Val 275 280 285 ttt gtg cgg ttc atggtg cga gac act cta cag ctc acc agc ggg gcc 912 Phe Val Arg Phe Met ValArg Asp Thr Leu Gln Leu Thr Ser Gly Ala 290 295 300 aaa ctc tcc aaa gccgcg gca gcc tac gcg cta gac ctc tgc aag gga 960 Lys Leu Ser Lys Ala AlaAla Ala Tyr Ala Leu Asp Leu Cys Lys Gly 305 310 315 320 cat tgc cca tacgag gcc atc aac gaa ttc att tcc acg ggg cgc ttt 1008 His Cys Pro Tyr GluAla Ile Asn Glu Phe Ile Ser Thr Gly Arg Phe 325 330 335 cac gcg gac gccgac ggc ttt cct gtc gaa agg tac tgg gaa gac cca 1056 His Ala Asp Ala AspGly Phe Pro Val Glu Arg Tyr Trp Glu Asp Pro 340 345 350 tcc gtc atc gctgaa cag aag aat cta tgg cag aaa gaa cac ccg ggc 1104 Ser Val Ile Ala GluGln Lys Asn Leu Trp Gln Lys Glu His Pro Gly 355 360 365 cag cag tac gacatc gtc caa gcg tgc ctc gaa tac tgc atg cag gac 1152 Gln Gln Tyr Asp IleVal Gln Ala Cys Leu Glu Tyr Cys Met Gln Asp 370 375 380 gtc cgc gtc acccaa aag ctg gcc cac acg tta cac gac agc tac gac 1200 Val Arg Val Thr GlnLys Leu Ala His Thr Leu His Asp Ser Tyr Asp 385 390 395 400 gcc tat ttccaa cga gaa cta ggg atg gaa ggc cat ttt aac atc ttc 1248 Ala Tyr Phe GlnArg Glu Leu Gly Met Glu Gly His Phe Asn Ile Phe 405 410 415 gtg cgg cccacc atc ccc agc aac act cat gcc ttt tgg aag caa ctt 1296 Val Arg Pro ThrIle Pro Ser Asn Thr His Ala Phe Trp Lys Gln Leu 420 425 430 acc ttt tccaat tac gtc cgc gaa cag cgt gcg aca tgc cct ccc tcc 1344 Thr Phe Ser AsnTyr Val Arg Glu Gln Arg Ala Thr Cys Pro Pro Ser 435 440 445 gtc ccc gaaccc ccc aaa aag aaa ggt cga acc aaa aag aaa aaa caa 1392 Val Pro Glu ProPro Lys Lys Lys Gly Arg Thr Lys Lys Lys Lys Gln 450 455 460 ccc tcc cccgac tac gtg gcc gaa gtc tac gcc ccc cac cgc ccc atg 1440 Pro Ser Pro AspTyr Val Ala Glu Val Tyr Ala Pro His Arg Pro Met 465 470 475 480 ttc aaatac ata cgc caa gcg ctc cgc ggc gga cga tgc tac ccc aac 1488 Phe Lys TyrIle Arg Gln Ala Leu Arg Gly Gly Arg Cys Tyr Pro Asn 485 490 495 gtg ctcgga cct tac ctg aaa ccc gtc tac gtc ttt gac atc tgc ggc 1536 Val Leu GlyPro Tyr Leu Lys Pro Val Tyr Val Phe Asp Ile Cys Gly 500 505 510 atg tacgct tcc gcc ctc acc cac ccc atg ccc cac gga atg ccc cta 1584 Met Tyr AlaSer Ala Leu Thr His Pro Met Pro His Gly Met Pro Leu 515 520 525 gat ccaaaa ttt acc gcg cag cac gtg gag gag ctc aac cgg ctg ctg 1632 Asp Pro LysPhe Thr Ala Gln His Val Glu Glu Leu Asn Arg Leu Leu 530 535 540 acc aacgaa tcg cat ctg agc tac ttc gat gcg cgt atc aag cct tcc 1680 Thr Asn GluSer His Leu Ser Tyr Phe Asp Ala Arg Ile Lys Pro Ser 545 550 555 560 atcctg aaa ata gaa gcc tac cct ccc ccg ccc gaa atg tta gac cca 1728 Ile LeuLys Ile Glu Ala Tyr Pro Pro Pro Pro Glu Met Leu Asp Pro 565 570 575 ctccct ccc atc tgc tcc cgg agg gga ggc aga ctg gtc tgg acc aac 1776 Leu ProPro Ile Cys Ser Arg Arg Gly Gly Arg Leu Val Trp Thr Asn 580 585 590 gaggct ctc tac gac gag gtg gtc acc gtc ata gat atc ctc acg ctg 1824 Glu AlaLeu Tyr Asp Glu Val Val Thr Val Ile Asp Ile Leu Thr Leu 595 600 605 cacaac cgg gga tgg cga gtc cag gtc ctc cat gac gag atg aac att 1872 His AsnArg Gly Trp Arg Val Gln Val Leu His Asp Glu Met Asn Ile 610 615 620 gttttt ccg gaa tgg aaa acg cta tgt gcc gac tac gtc acg aaa aac 1920 Val PhePro Glu Trp Lys Thr Leu Cys Ala Asp Tyr Val Thr Lys Asn 625 630 635 640atc ctc gcc aaa gaa aaa gcc gat cgc gag aag aac gaa gtg att cga 1968 IleLeu Ala Lys Glu Lys Ala Asp Arg Glu Lys Asn Glu Val Ile Arg 645 650 655tcc atc tcc aaa atg ctg agc aac gcg ctg tac ggt gcg ttt gcc acc 2016 SerIle Ser Lys Met Leu Ser Asn Ala Leu Tyr Gly Ala Phe Ala Thr 660 665 670aac atg gac acc acg cgc atc atc ttt gaa cag gac ctc tcg gaa gca 2064 AsnMet Asp Thr Thr Arg Ile Ile Phe Glu Gln Asp Leu Ser Glu Ala 675 680 685gat aag aaa aac atc tac gaa ggc act gaa atc gtc aaa cac gtc acg 2112 AspLys Lys Asn Ile Tyr Glu Gly Thr Glu Ile Val Lys His Val Thr 690 695 700ctg ctc aat gac gac tcg ttc aac gga acg gaa gtc acc ctc gaa aac 2160 LeuLeu Asn Asp Asp Ser Phe Asn Gly Thr Glu Val Thr Leu Glu Asn 705 710 715720 gcg cct aac ccc ttc agt gag gag agt cta cga caa cag ttc cgc tac 2208Ala Pro Asn Pro Phe Ser Glu Glu Ser Leu Arg Gln Gln Phe Arg Tyr 725 730735 gca gac gac ccc gaa cag gaa gag ccc gaa gca gaa gag gat ggg gaa 2256Ala Asp Asp Pro Glu Gln Glu Glu Pro Glu Ala Glu Glu Asp Gly Glu 740 745750 gaa gaa gga gac gac agc gat cgc gag agt gcc cgt aag ccg aaa aac 2304Glu Glu Gly Asp Asp Ser Asp Arg Glu Ser Ala Arg Lys Pro Lys Asn 755 760765 gca ctt acc gaa gac gat cct ctc gtc gcc gta gac ctg gaa gtc gag 2352Ala Leu Thr Glu Asp Asp Pro Leu Val Ala Val Asp Leu Glu Val Glu 770 775780 gcg acc ctc gcg acg ggc cct tat ata ccc gag ggg gag cta agc tcc 2400Ala Thr Leu Ala Thr Gly Pro Tyr Ile Pro Glu Gly Glu Leu Ser Ser 785 790795 800 gcc cac tac gct cgc gct aac gag acc cgg ttt aaa cct atg cgt ctc2448 Ala His Tyr Ala Arg Ala Asn Glu Thr Arg Phe Lys Pro Met Arg Leu 805810 815 ctc gaa gcc aca cca gaa gcc cta acc gtg ctc cat ctg gaa agc ctg2496 Leu Glu Ala Thr Pro Glu Ala Leu Thr Val Leu His Leu Glu Ser Leu 820825 830 gac aag cag gtg gca aac aaa aga tac gcc acg caa atc gcc tgc ttc2544 Asp Lys Gln Val Ala Asn Lys Arg Tyr Ala Thr Gln Ile Ala Cys Phe 835840 845 gtg ctg ggc tgg tcg agg gcc ttc ttc agc gag tgg tgt gac atc ctg2592 Val Leu Gly Trp Ser Arg Ala Phe Phe Ser Glu Trp Cys Asp Ile Leu 850855 860 tac gga ccg gac aga gga gtg cac atc ctg cga agg gag gag ccg cgc2640 Tyr Gly Pro Asp Arg Gly Val His Ile Leu Arg Arg Glu Glu Pro Arg 865870 875 880 agc ctc tat ggc gat acc gac agc ctg ttc gtc aca gaa aca ggctat 2688 Ser Leu Tyr Gly Asp Thr Asp Ser Leu Phe Val Thr Glu Thr Gly Tyr885 890 895 cat cgc atg aaa agc cgc ggc gcg cac cga atc aaa aca gaa tccact 2736 His Arg Met Lys Ser Arg Gly Ala His Arg Ile Lys Thr Glu Ser Thr900 905 910 cga ctg act ttc gat cca gaa aat ccc ggc ctc tac tgg gcc tgcgat 2784 Arg Leu Thr Phe Asp Pro Glu Asn Pro Gly Leu Tyr Trp Ala Cys Asp915 920 925 tgc gac atc aag tgc aaa gcc tgc gga agt gac acg tac agc tcggaa 2832 Cys Asp Ile Lys Cys Lys Ala Cys Gly Ser Asp Thr Tyr Ser Ser Glu930 935 940 acc atc ttc cta gcg cca aaa ctg tac gga ctg aaa aac tca atctgc 2880 Thr Ile Phe Leu Ala Pro Lys Leu Tyr Gly Leu Lys Asn Ser Ile Cys945 950 955 960 gtc aac gaa cag tgc cgc acg gta gga ccc ggg aaa atc agatcg aag 2928 Val Asn Glu Gln Cys Arg Thr Val Gly Pro Gly Lys Ile Arg SerLys 965 970 975 gga cac agg cag tcc gaa ctc atc tac gac acg ctg ctg cgctgt tgg 2976 Gly His Arg Gln Ser Glu Leu Ile Tyr Asp Thr Leu Leu Arg CysTrp 980 985 990 cgt aga cac gag gac gtg caa ttc gga gcg cag agc aac atccca gag 3024 Arg Arg His Glu Asp Val Gln Phe Gly Ala Gln Ser Asn Ile ProGlu 995 1000 1005 cta cac acg cgg aga acc atc ttt aaa acc acg ctt ctgaac aag gtc 3072 Leu His Thr Arg Arg Thr Ile Phe Lys Thr Thr Leu Leu AsnLys Val 1010 1015 1020 agt cgc tac gac cct ttc acc att cac aac gag cagctc acg cga gtg 3120 Ser Arg Tyr Asp Pro Phe Thr Ile His Asn Glu Gln LeuThr Arg Val 1025 1030 1035 1040 ctg cgt ccg tgg aag gac ctc acc cta tacgag cac ggg gac tac ctg 3168 Leu Arg Pro Trp Lys Asp Leu Thr Leu Tyr GluHis Gly Asp Tyr Leu 1045 1050 1055 tac ccc tac gac aat gag cac cca aatccc cgc acg aca gga gac gta 3216 Tyr Pro Tyr Asp Asn Glu His Pro Asn ProArg Thr Thr Gly Asp Val 1060 1065 1070 cga ccc gtc cca atc gtc ggg cacgaa gac ccc ctc gca ccc cta cga 3264 Arg Pro Val Pro Ile Val Gly His GluAsp Pro Leu Ala Pro Leu Arg 1075 1080 1085 tgg gaa ccc tac gcg ttc ctatcc gaa gag gaa tgc ggg caa gtt cac 3312 Trp Glu Pro Tyr Ala Phe Leu SerGlu Glu Glu Cys Gly Gln Val His 1090 1095 1100 gac cta ctc ttc gca gatgat agc tcc cag gaa gcg gaa agc ctg gga 3360 Asp Leu Leu Phe Ala Asp AspSer Ser Gln Glu Ala Glu Ser Leu Gly 1105 1110 1115 1120 gta tga 3366 Val<210> SEQ ID NO 28 <211> LENGTH: 1121 <212> TYPE: PRT <213> ORGANISM:CELO VIRUS <220> FEATURE: <223> OTHER INFORMATION: Position:6501..9866/Product: E2b pol <400> SEQUENCE: 28 Met Leu Ile Ala Lys AsnVal Thr Gly Glu Trp Val Trp Ile Thr Ser 1 5 10 15 Arg Thr Pro Val GlnGln Cys Pro Thr Cys Gly Arg His Trp Val Arg 20 25 30 Arg His Ser Cys AsnGlu Arg Arg Ser Ala Phe Tyr Tyr His Ala Val 35 40 45 Gln Gly Ser Gly SerAsp Leu Trp Gln His Val His Phe Ser Cys Pro 50 55 60 Ala Gln His Pro HisIle Arg Gln Leu Tyr Ile Thr Tyr Asp Ile Glu 65 70 75 80 Thr Tyr Thr ValPhe Glu Lys Lys Gly Lys Arg Met His Pro Phe Met 85 90 95 Leu Cys Phe MetLeu Ser Gly Asp Pro Gln Leu Val Ser Arg Ala Glu 100 105 110 Arg Leu AlaArg Gln Asp Asp Arg Leu Lys Ala Leu Asp Glu Gly Phe 115 120 125 Tyr TrpLeu Asp Ser His Pro Gly Glu Val Ala Arg Arg Phe Arg Asn 130 135 140 PheArg Ser Arg Leu Gln Ile Glu Phe Ala Gln Asn Leu Val Asp Arg 145 150 155160 Tyr Ala Ala Ala Asn Arg Asp Tyr Cys Asp Gln Leu Val Lys Asp Gly 165170 175 Lys Tyr Gly Ser Val His Lys Ile Pro Tyr Glu Leu Phe Glu Lys Pro180 185 190 Thr Ser Pro Leu Ser Leu Pro Asp Asn Phe Tyr Ser Val Asp IleVal 195 200 205 Val Leu Gly His Asn Ile Cys Lys Phe Asp Glu Leu Leu LeuAla Thr 210 215 220 Glu Leu Val Glu Arg Arg Asp Leu Phe Pro Glu Ala CysLys Cys Asp 225 230 235 240 Arg Ser Phe Met Pro Arg Val Gly Arg Leu LeuPhe Asn Asp Ile Ile 245 250 255 Phe Arg Met Pro Asn Pro Asn Tyr Val LysLys Asp Ala Ser Arg Val 260 265 270 Glu Arg Trp Ser Arg Gly Ile Val SerHis Gln Asp Ala Arg Ser Val 275 280 285 Phe Val Arg Phe Met Val Arg AspThr Leu Gln Leu Thr Ser Gly Ala 290 295 300 Lys Leu Ser Lys Ala Ala AlaAla Tyr Ala Leu Asp Leu Cys Lys Gly 305 310 315 320 His Cys Pro Tyr GluAla Ile Asn Glu Phe Ile Ser Thr Gly Arg Phe 325 330 335 His Ala Asp AlaAsp Gly Phe Pro Val Glu Arg Tyr Trp Glu Asp Pro 340 345 350 Ser Val IleAla Glu Gln Lys Asn Leu Trp Gln Lys Glu His Pro Gly 355 360 365 Gln GlnTyr Asp Ile Val Gln Ala Cys Leu Glu Tyr Cys Met Gln Asp 370 375 380 ValArg Val Thr Gln Lys Leu Ala His Thr Leu His Asp Ser Tyr Asp 385 390 395400 Ala Tyr Phe Gln Arg Glu Leu Gly Met Glu Gly His Phe Asn Ile Phe 405410 415 Val Arg Pro Thr Ile Pro Ser Asn Thr His Ala Phe Trp Lys Gln Leu420 425 430 Thr Phe Ser Asn Tyr Val Arg Glu Gln Arg Ala Thr Cys Pro ProSer 435 440 445 Val Pro Glu Pro Pro Lys Lys Lys Gly Arg Thr Lys Lys LysLys Gln 450 455 460 Pro Ser Pro Asp Tyr Val Ala Glu Val Tyr Ala Pro HisArg Pro Met 465 470 475 480 Phe Lys Tyr Ile Arg Gln Ala Leu Arg Gly GlyArg Cys Tyr Pro Asn 485 490 495 Val Leu Gly Pro Tyr Leu Lys Pro Val TyrVal Phe Asp Ile Cys Gly 500 505 510 Met Tyr Ala Ser Ala Leu Thr His ProMet Pro His Gly Met Pro Leu 515 520 525 Asp Pro Lys Phe Thr Ala Gln HisVal Glu Glu Leu Asn Arg Leu Leu 530 535 540 Thr Asn Glu Ser His Leu SerTyr Phe Asp Ala Arg Ile Lys Pro Ser 545 550 555 560 Ile Leu Lys Ile GluAla Tyr Pro Pro Pro Pro Glu Met Leu Asp Pro 565 570 575 Leu Pro Pro IleCys Ser Arg Arg Gly Gly Arg Leu Val Trp Thr Asn 580 585 590 Glu Ala LeuTyr Asp Glu Val Val Thr Val Ile Asp Ile Leu Thr Leu 595 600 605 His AsnArg Gly Trp Arg Val Gln Val Leu His Asp Glu Met Asn Ile 610 615 620 ValPhe Pro Glu Trp Lys Thr Leu Cys Ala Asp Tyr Val Thr Lys Asn 625 630 635640 Ile Leu Ala Lys Glu Lys Ala Asp Arg Glu Lys Asn Glu Val Ile Arg 645650 655 Ser Ile Ser Lys Met Leu Ser Asn Ala Leu Tyr Gly Ala Phe Ala Thr660 665 670 Asn Met Asp Thr Thr Arg Ile Ile Phe Glu Gln Asp Leu Ser GluAla 675 680 685 Asp Lys Lys Asn Ile Tyr Glu Gly Thr Glu Ile Val Lys HisVal Thr 690 695 700 Leu Leu Asn Asp Asp Ser Phe Asn Gly Thr Glu Val ThrLeu Glu Asn 705 710 715 720 Ala Pro Asn Pro Phe Ser Glu Glu Ser Leu ArgGln Gln Phe Arg Tyr 725 730 735 Ala Asp Asp Pro Glu Gln Glu Glu Pro GluAla Glu Glu Asp Gly Glu 740 745 750 Glu Glu Gly Asp Asp Ser Asp Arg GluSer Ala Arg Lys Pro Lys Asn 755 760 765 Ala Leu Thr Glu Asp Asp Pro LeuVal Ala Val Asp Leu Glu Val Glu 770 775 780 Ala Thr Leu Ala Thr Gly ProTyr Ile Pro Glu Gly Glu Leu Ser Ser 785 790 795 800 Ala His Tyr Ala ArgAla Asn Glu Thr Arg Phe Lys Pro Met Arg Leu 805 810 815 Leu Glu Ala ThrPro Glu Ala Leu Thr Val Leu His Leu Glu Ser Leu 820 825 830 Asp Lys GlnVal Ala Asn Lys Arg Tyr Ala Thr Gln Ile Ala Cys Phe 835 840 845 Val LeuGly Trp Ser Arg Ala Phe Phe Ser Glu Trp Cys Asp Ile Leu 850 855 860 TyrGly Pro Asp Arg Gly Val His Ile Leu Arg Arg Glu Glu Pro Arg 865 870 875880 Ser Leu Tyr Gly Asp Thr Asp Ser Leu Phe Val Thr Glu Thr Gly Tyr 885890 895 His Arg Met Lys Ser Arg Gly Ala His Arg Ile Lys Thr Glu Ser Thr900 905 910 Arg Leu Thr Phe Asp Pro Glu Asn Pro Gly Leu Tyr Trp Ala CysAsp 915 920 925 Cys Asp Ile Lys Cys Lys Ala Cys Gly Ser Asp Thr Tyr SerSer Glu 930 935 940 Thr Ile Phe Leu Ala Pro Lys Leu Tyr Gly Leu Lys AsnSer Ile Cys 945 950 955 960 Val Asn Glu Gln Cys Arg Thr Val Gly Pro GlyLys Ile Arg Ser Lys 965 970 975 Gly His Arg Gln Ser Glu Leu Ile Tyr AspThr Leu Leu Arg Cys Trp 980 985 990 Arg Arg His Glu Asp Val Gln Phe GlyAla Gln Ser Asn Ile Pro Glu 995 1000 1005 Leu His Thr Arg Arg Thr IlePhe Lys Thr Thr Leu Leu Asn Lys Val 1010 1015 1020 Ser Arg Tyr Asp ProPhe Thr Ile His Asn Glu Gln Leu Thr Arg Val 1025 1030 1035 1040 Leu ArgPro Trp Lys Asp Leu Thr Leu Tyr Glu His Gly Asp Tyr Leu 1045 1050 1055Tyr Pro Tyr Asp Asn Glu His Pro Asn Pro Arg Thr Thr Gly Asp Val 10601065 1070 Arg Pro Val Pro Ile Val Gly His Glu Asp Pro Leu Ala Pro LeuArg 1075 1080 1085 Trp Glu Pro Tyr Ala Phe Leu Ser Glu Glu Glu Cys GlyGln Val His 1090 1095 1100 Asp Leu Leu Phe Ala Asp Asp Ser Ser Gln GluAla Glu Ser Leu Gly 1105 1110 1115 1120 Val <210> SEQ ID NO 29 <211>LENGTH: 1728 <212> TYPE: DNA <213> ORGANISM: CELO VIRUS <220> FEATURE:<221> NAME/KEY: CDS <222> LOCATION: (1)..(1728) <400> SEQUENCE: 29 atgcaa ctc cga gac ctc gcg ccg cga tcg ccg aac gtc gcc gcg ccg 48 Met GlnLeu Arg Asp Leu Ala Pro Arg Ser Pro Asn Val Ala Ala Pro 1 5 10 15 ccctac aac gga ttg ccg ccg ccg cac ctt ctc ctc ggg tac caa gct 96 Pro TyrAsn Gly Leu Pro Pro Pro His Leu Leu Leu Gly Tyr Gln Ala 20 25 30 atg caccgc gcg ctc aac gat tac ctt ttc gac aac cgc gtt ttt atg 144 Met His ArgAla Leu Asn Asp Tyr Leu Phe Asp Asn Arg Val Phe Met 35 40 45 cag ata ggttac gat agc cca ccc caa aga ccc aga cgc ctc ttt tgg 192 Gln Ile Gly TyrAsp Ser Pro Pro Gln Arg Pro Arg Arg Leu Phe Trp 50 55 60 acc tgt ctg accgac tgc tcc tac gcc gtc aat gta ggg cag tac atg 240 Thr Cys Leu Thr AspCys Ser Tyr Ala Val Asn Val Gly Gln Tyr Met 65 70 75 80 cga ttt ctc gatctc gac aac ttt cac ggt acg ttc acg cag atg cac 288 Arg Phe Leu Asp LeuAsp Asn Phe His Gly Thr Phe Thr Gln Met His 85 90 95 aac gcc gta ctc atggac cgc gtg gcc gcg gac atg ggc cgg gcg cat 336 Asn Ala Val Leu Met AspArg Val Ala Ala Asp Met Gly Arg Ala His 100 105 110 ctg cga ggt agg ggaatc gac gta ggc cgt cac gga caa gtg ttg ccg 384 Leu Arg Gly Arg Gly IleAsp Val Gly Arg His Gly Gln Val Leu Pro 115 120 125 cag ctc gac gcc gaacac cac agc cta ctg tcg ggc aac gga gcg ggt 432 Gln Leu Asp Ala Glu HisHis Ser Leu Leu Ser Gly Asn Gly Ala Gly 130 135 140 ggc ttg caa gaa ggcgtc ctc atg cga acg gcc tct gcc gcc gac gcc 480 Gly Leu Gln Glu Gly ValLeu Met Arg Thr Ala Ser Ala Ala Asp Ala 145 150 155 160 gaa ctg ctc gccgcc atc cgc caa cta aga gtc gcc ctc tgc cac tat 528 Glu Leu Leu Ala AlaIle Arg Gln Leu Arg Val Ala Leu Cys His Tyr 165 170 175 cta ttc tgc tacgca tat gat cta ttt caa acg gaa gaa aga tat cgg 576 Leu Phe Cys Tyr AlaTyr Asp Leu Phe Gln Thr Glu Glu Arg Tyr Arg 180 185 190 ttc tta cct ggatcc gat gtg ttc ctt gaa cca aac tgg ctc tcc tac 624 Phe Leu Pro Gly SerAsp Val Phe Leu Glu Pro Asn Trp Leu Ser Tyr 195 200 205 ttc gcg gaa gccttc gcg gag cta gac acc cag caa ctg gtg cgg gat 672 Phe Ala Glu Ala PheAla Glu Leu Asp Thr Gln Gln Leu Val Arg Asp 210 215 220 gcc gag cgc aagttt cga gga aga cgg gac gta gag gaa cct acg gaa 720 Ala Glu Arg Lys PheArg Gly Arg Arg Asp Val Glu Glu Pro Thr Glu 225 230 235 240 aca atg gcgaga tgt ttc atg agc act cta gcg agc gac gcc gtt tcc 768 Thr Met Ala ArgCys Phe Met Ser Thr Leu Ala Ser Asp Ala Val Ser 245 250 255 tta gca ggaacg ggt ctg tca gga ggc gcc atc acc ctc tgc agc cgg 816 Leu Ala Gly ThrGly Leu Ser Gly Gly Ala Ile Thr Leu Cys Ser Arg 260 265 270 cgg gta accgac cgc acc ggc ctg cgc cct aga gac cgc cac ggc aga 864 Arg Val Thr AspArg Thr Gly Leu Arg Pro Arg Asp Arg His Gly Arg 275 280 285 gcc atc accgcg tcc gaa gcg cgc cgc att agg ccc cgt gcc gtg cgg 912 Ala Ile Thr AlaSer Glu Ala Arg Arg Ile Arg Pro Arg Ala Val Arg 290 295 300 gcc ttc gtagac cgc ctg ccc cgc gtc acg cgg cgg cga cgg aga ccc 960 Ala Phe Val AspArg Leu Pro Arg Val Thr Arg Arg Arg Arg Arg Pro 305 310 315 320 ccc tccccc gcg ccc cct ccc gaa gaa ata gaa gaa gcc gcc atg gaa 1008 Pro Ser ProAla Pro Pro Pro Glu Glu Ile Glu Glu Ala Ala Met Glu 325 330 335 gta gaagaa cca gaa gag gag gaa gaa gag ctg tta gac gag gtg att 1056 Val Glu GluPro Glu Glu Glu Glu Glu Glu Leu Leu Asp Glu Val Ile 340 345 350 cgc acagcg ctc gaa gcc atc ggg gca ctg caa gac gag ctc agc ggg 1104 Arg Thr AlaLeu Glu Ala Ile Gly Ala Leu Gln Asp Glu Leu Ser Gly 355 360 365 gcc gcccgg aga cac gaa ctc ttc agg ttt gcc aac gac ttc tac cgc 1152 Ala Ala ArgArg His Glu Leu Phe Arg Phe Ala Asn Asp Phe Tyr Arg 370 375 380 atg ctcctg acc gcg cgc gac gcg gga ctc atg gga gag tcg ttc ctg 1200 Met Leu LeuThr Ala Arg Asp Ala Gly Leu Met Gly Glu Ser Phe Leu 385 390 395 400 cgcaag tgg gtg ctg tac ttc ttc tta gcc gaa cat atc gcc tct aca 1248 Arg LysTrp Val Leu Tyr Phe Phe Leu Ala Glu His Ile Ala Ser Thr 405 410 415 ctctac tac ctg tac agc cac ttc atc gcc aac cgc gag ttc cgc cgg 1296 Leu TyrTyr Leu Tyr Ser His Phe Ile Ala Asn Arg Glu Phe Arg Arg 420 425 430 tacgtc gac gtc ctg acc tta caa gta ctc gtc gtg gga tgg gac gtc 1344 Tyr ValAsp Val Leu Thr Leu Gln Val Leu Val Val Gly Trp Asp Val 435 440 445 aacgcg cag cag gtt ttt aaa cgc ata tgg agc gag caa tcc aac ccc 1392 Asn AlaGln Gln Val Phe Lys Arg Ile Trp Ser Glu Gln Ser Asn Pro 450 455 460 gccacc ata ttc gaa acc ctg tgg gaa cgt ata tta cgc gat ttt ctt 1440 Ala ThrIle Phe Glu Thr Leu Trp Glu Arg Ile Leu Arg Asp Phe Leu 465 470 475 480atg atg gtc gaa cgg acg ggt caa ttc gaa ggc atg gac gat gcg gac 1488 MetMet Val Glu Arg Thr Gly Gln Phe Glu Gly Met Asp Asp Ala Asp 485 490 495caa caa ctg ttt ctc tct gat att caa tac aga gac cgc tcc ggt gac 1536 GlnGln Leu Phe Leu Ser Asp Ile Gln Tyr Arg Asp Arg Ser Gly Asp 500 505 510atc gaa gaa gtg ctg aag cag ctc aac ctc agc gaa gag ctg atc gat 1584 IleGlu Glu Val Leu Lys Gln Leu Asn Leu Ser Glu Glu Leu Ile Asp 515 520 525agc atc gac atc agt ttc cgc atc aaa ttt aaa ggc atc gta gcc atc 1632 SerIle Asp Ile Ser Phe Arg Ile Lys Phe Lys Gly Ile Val Ala Ile 530 535 540gct acc aac gag gag atc aaa gcc aac ctc aga cgc gtg ctc cgc cac 1680 AlaThr Asn Glu Glu Ile Lys Ala Asn Leu Arg Arg Val Leu Arg His 545 550 555560 cgc cgc gaa gac atc gaa gcg gcg gcg cga cga ggt cag cct ctg taa 1728Arg Arg Glu Asp Ile Glu Ala Ala Ala Arg Arg Gly Gln Pro Leu 565 570 575<210> SEQ ID NO 30 <211> LENGTH: 575 <212> TYPE: PRT <213> ORGANISM:CELO VIRUS <220> FEATURE: <223> OTHER INFORMATION: Position:10269..11996/Product: E2b pTP <400> SEQUENCE: 30 Met Gln Leu Arg Asp LeuAla Pro Arg Ser Pro Asn Val Ala Ala Pro 1 5 10 15 Pro Tyr Asn Gly LeuPro Pro Pro His Leu Leu Leu Gly Tyr Gln Ala 20 25 30 Met His Arg Ala LeuAsn Asp Tyr Leu Phe Asp Asn Arg Val Phe Met 35 40 45 Gln Ile Gly Tyr AspSer Pro Pro Gln Arg Pro Arg Arg Leu Phe Trp 50 55 60 Thr Cys Leu Thr AspCys Ser Tyr Ala Val Asn Val Gly Gln Tyr Met 65 70 75 80 Arg Phe Leu AspLeu Asp Asn Phe His Gly Thr Phe Thr Gln Met His 85 90 95 Asn Ala Val LeuMet Asp Arg Val Ala Ala Asp Met Gly Arg Ala His 100 105 110 Leu Arg GlyArg Gly Ile Asp Val Gly Arg His Gly Gln Val Leu Pro 115 120 125 Gln LeuAsp Ala Glu His His Ser Leu Leu Ser Gly Asn Gly Ala Gly 130 135 140 GlyLeu Gln Glu Gly Val Leu Met Arg Thr Ala Ser Ala Ala Asp Ala 145 150 155160 Glu Leu Leu Ala Ala Ile Arg Gln Leu Arg Val Ala Leu Cys His Tyr 165170 175 Leu Phe Cys Tyr Ala Tyr Asp Leu Phe Gln Thr Glu Glu Arg Tyr Arg180 185 190 Phe Leu Pro Gly Ser Asp Val Phe Leu Glu Pro Asn Trp Leu SerTyr 195 200 205 Phe Ala Glu Ala Phe Ala Glu Leu Asp Thr Gln Gln Leu ValArg Asp 210 215 220 Ala Glu Arg Lys Phe Arg Gly Arg Arg Asp Val Glu GluPro Thr Glu 225 230 235 240 Thr Met Ala Arg Cys Phe Met Ser Thr Leu AlaSer Asp Ala Val Ser 245 250 255 Leu Ala Gly Thr Gly Leu Ser Gly Gly AlaIle Thr Leu Cys Ser Arg 260 265 270 Arg Val Thr Asp Arg Thr Gly Leu ArgPro Arg Asp Arg His Gly Arg 275 280 285 Ala Ile Thr Ala Ser Glu Ala ArgArg Ile Arg Pro Arg Ala Val Arg 290 295 300 Ala Phe Val Asp Arg Leu ProArg Val Thr Arg Arg Arg Arg Arg Pro 305 310 315 320 Pro Ser Pro Ala ProPro Pro Glu Glu Ile Glu Glu Ala Ala Met Glu 325 330 335 Val Glu Glu ProGlu Glu Glu Glu Glu Glu Leu Leu Asp Glu Val Ile 340 345 350 Arg Thr AlaLeu Glu Ala Ile Gly Ala Leu Gln Asp Glu Leu Ser Gly 355 360 365 Ala AlaArg Arg His Glu Leu Phe Arg Phe Ala Asn Asp Phe Tyr Arg 370 375 380 MetLeu Leu Thr Ala Arg Asp Ala Gly Leu Met Gly Glu Ser Phe Leu 385 390 395400 Arg Lys Trp Val Leu Tyr Phe Phe Leu Ala Glu His Ile Ala Ser Thr 405410 415 Leu Tyr Tyr Leu Tyr Ser His Phe Ile Ala Asn Arg Glu Phe Arg Arg420 425 430 Tyr Val Asp Val Leu Thr Leu Gln Val Leu Val Val Gly Trp AspVal 435 440 445 Asn Ala Gln Gln Val Phe Lys Arg Ile Trp Ser Glu Gln SerAsn Pro 450 455 460 Ala Thr Ile Phe Glu Thr Leu Trp Glu Arg Ile Leu ArgAsp Phe Leu 465 470 475 480 Met Met Val Glu Arg Thr Gly Gln Phe Glu GlyMet Asp Asp Ala Asp 485 490 495 Gln Gln Leu Phe Leu Ser Asp Ile Gln TyrArg Asp Arg Ser Gly Asp 500 505 510 Ile Glu Glu Val Leu Lys Gln Leu AsnLeu Ser Glu Glu Leu Ile Asp 515 520 525 Ser Ile Asp Ile Ser Phe Arg IleLys Phe Lys Gly Ile Val Ala Ile 530 535 540 Ala Thr Asn Glu Glu Ile LysAla Asn Leu Arg Arg Val Leu Arg His 545 550 555 560 Arg Arg Glu Asp IleGlu Ala Ala Ala Arg Arg Gly Gln Pro Leu 565 570 575 <210> SEQ ID NO 31<211> LENGTH: 1320 <212> TYPE: DNA <213> ORGANISM: CELO VIRUS <220>FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (1)..(1320) <400> SEQUENCE:31 atg agc acc caa atc ccc gca cga cag gag acg tac gac ccg tcc caa 48Met Ser Thr Gln Ile Pro Ala Arg Gln Glu Thr Tyr Asp Pro Ser Gln 1 5 1015 tcg tcg ggc acg aag acc ccc tcg cac ccc tac gat ggg aac cct acg 96Ser Ser Gly Thr Lys Thr Pro Ser His Pro Tyr Asp Gly Asn Pro Thr 20 25 30cgt tcc tat ccg aag agg aat gcg ggc aag ttc acg acc tac tct tcg 144 ArgSer Tyr Pro Lys Arg Asn Ala Gly Lys Phe Thr Thr Tyr Ser Ser 35 40 45 cagatg ata gct ccc agg aag cgg aaa gcc tgg gag tat gag gaa gaa 192 Gln MetIle Ala Pro Arg Lys Arg Lys Ala Trp Glu Tyr Glu Glu Glu 50 55 60 gag tacgaa gcc tcg cgg gac ttc tac cag cgc gtc acc agc tgg tac 240 Glu Tyr GluAla Ser Arg Asp Phe Tyr Gln Arg Val Thr Ser Trp Tyr 65 70 75 80 gac ggagct gtc gac cta gca ccg cag ctc ttc cgc gag caa cac ttc 288 Asp Gly AlaVal Asp Leu Ala Pro Gln Leu Phe Arg Glu Gln His Phe 85 90 95 ccc tcc tacgac gag ttc tac agc cta ggg ggc gtt aat gag aag ttt 336 Pro Ser Tyr AspGlu Phe Tyr Ser Leu Gly Gly Val Asn Glu Lys Phe 100 105 110 ctc gaa gcccac gaa gaa gtt aaa gcc cag gaa cag atg gac agt cgc 384 Leu Glu Ala HisGlu Glu Val Lys Ala Gln Glu Gln Met Asp Ser Arg 115 120 125 tac ctc caacac gga cag ctg ccg tcc atc aac atg ggc aag cag ccc 432 Tyr Leu Gln HisGly Gln Leu Pro Ser Ile Asn Met Gly Lys Gln Pro 130 135 140 atc atc ggggtc atc tac gga ccc acc gga tcc ggc aag tcg cat ctg 480 Ile Ile Gly ValIle Tyr Gly Pro Thr Gly Ser Gly Lys Ser His Leu 145 150 155 160 ctg cgggcg ctc atc tcg tgc aac atg ttg gac ccg atc ccc gaa acg 528 Leu Arg AlaLeu Ile Ser Cys Asn Met Leu Asp Pro Ile Pro Glu Thr 165 170 175 gtc atcttc atc act ccg gaa aag aac atg att cca ccc atc gaa cag 576 Val Ile PheIle Thr Pro Glu Lys Asn Met Ile Pro Pro Ile Glu Gln 180 185 190 acg tcctgg aac ctg cag ctg gtc gag gcc aat ttc gac tgc agg gaa 624 Thr Ser TrpAsn Leu Gln Leu Val Glu Ala Asn Phe Asp Cys Arg Glu 195 200 205 gac ggcacc atc gcc cct aag aca agc acg ttc cgt ccc gaa ttt atg 672 Asp Gly ThrIle Ala Pro Lys Thr Ser Thr Phe Arg Pro Glu Phe Met 210 215 220 gag atgact tac gag gag gcc acc gca ccc gaa cat ctc aac atc gac 720 Glu Met ThrTyr Glu Glu Ala Thr Ala Pro Glu His Leu Asn Ile Asp 225 230 235 240 catcca gac aac att tac gtg aaa gtc tcc aag cgg gga ccc gtc gcc 768 His ProAsp Asn Ile Tyr Val Lys Val Ser Lys Arg Gly Pro Val Ala 245 250 255 attatc atg gac gag tgc atg gat aag ctc tgt tca ggc tcc agc gtc 816 Ile IleMet Asp Glu Cys Met Asp Lys Leu Cys Ser Gly Ser Ser Val 260 265 270 tctgtc ctc ttt cac gcc ctt cct tct aag ctc ttt gct cgc tct gcc 864 Ser ValLeu Phe His Ala Leu Pro Ser Lys Leu Phe Ala Arg Ser Ala 275 280 285 cactgt aca gcc ttc tac att ttc gta gtc ttg cac aac atg gca ccg 912 His CysThr Ala Phe Tyr Ile Phe Val Val Leu His Asn Met Ala Pro 290 295 300 cgcacc gcg ata gga aac gtt ccc acc ctc aaa gtg aac gcg aaa atg 960 Arg ThrAla Ile Gly Asn Val Pro Thr Leu Lys Val Asn Ala Lys Met 305 310 315 320cac atc cta tcc tgt cat att ccc caa ttc cag ttc gct agg ttc ctc 1008 HisIle Leu Ser Cys His Ile Pro Gln Phe Gln Phe Ala Arg Phe Leu 325 330 335tat gcg ttc gca cac aac atc tcg aag gac ctc gtt gtc ctt ctc aaa 1056 TyrAla Phe Ala His Asn Ile Ser Lys Asp Leu Val Val Leu Leu Lys 340 345 350gct tac ttt tcc ttc ctg cag cag aac cag cgg ttc agc tgg gtc atg 1104 AlaTyr Phe Ser Phe Leu Gln Gln Asn Gln Arg Phe Ser Trp Val Met 355 360 365tac act ccg gac cca gta tcc gag tcc ttt agg tgg tgc agt ata gat 1152 TyrThr Pro Asp Pro Val Ser Glu Ser Phe Arg Trp Cys Ser Ile Asp 370 375 380cag cag tac tcg atc atc cct ctc aat gtt aac att cag gag aga ttc 1200 GlnGln Tyr Ser Ile Ile Pro Leu Asn Val Asn Ile Gln Glu Arg Phe 385 390 395400 ctg aaa aca gcc aaa tct atc atc aaa ttt agc gaa aca cat aga aag 1248Leu Lys Thr Ala Lys Ser Ile Ile Lys Phe Ser Glu Thr His Arg Lys 405 410415 cag tta gag aga aac ccc aaa cta acc gat ctc gaa aaa ctt tct ccc 1296Gln Leu Glu Arg Asn Pro Lys Leu Thr Asp Leu Glu Lys Leu Ser Pro 420 425430 cca gga acg ttt cag gaa act taa 1320 Pro Gly Thr Phe Gln Glu Thr 435<210> SEQ ID NO 32 <211> LENGTH: 439 <212> TYPE: PRT <213> ORGANISM:CELO VIRUS <220> FEATURE: <223> OTHER INFORMATION: Position:5366..6685/Product:IVa2 <400> SEQUENCE: 32 Met Ser Thr Gln Ile Pro AlaArg Gln Glu Thr Tyr Asp Pro Ser Gln 1 5 10 15 Ser Ser Gly Thr Lys ThrPro Ser His Pro Tyr Asp Gly Asn Pro Thr 20 25 30 Arg Ser Tyr Pro Lys ArgAsn Ala Gly Lys Phe Thr Thr Tyr Ser Ser 35 40 45 Gln Met Ile Ala Pro ArgLys Arg Lys Ala Trp Glu Tyr Glu Glu Glu 50 55 60 Glu Tyr Glu Ala Ser ArgAsp Phe Tyr Gln Arg Val Thr Ser Trp Tyr 65 70 75 80 Asp Gly Ala Val AspLeu Ala Pro Gln Leu Phe Arg Glu Gln His Phe 85 90 95 Pro Ser Tyr Asp GluPhe Tyr Ser Leu Gly Gly Val Asn Glu Lys Phe 100 105 110 Leu Glu Ala HisGlu Glu Val Lys Ala Gln Glu Gln Met Asp Ser Arg 115 120 125 Tyr Leu GlnHis Gly Gln Leu Pro Ser Ile Asn Met Gly Lys Gln Pro 130 135 140 Ile IleGly Val Ile Tyr Gly Pro Thr Gly Ser Gly Lys Ser His Leu 145 150 155 160Leu Arg Ala Leu Ile Ser Cys Asn Met Leu Asp Pro Ile Pro Glu Thr 165 170175 Val Ile Phe Ile Thr Pro Glu Lys Asn Met Ile Pro Pro Ile Glu Gln 180185 190 Thr Ser Trp Asn Leu Gln Leu Val Glu Ala Asn Phe Asp Cys Arg Glu195 200 205 Asp Gly Thr Ile Ala Pro Lys Thr Ser Thr Phe Arg Pro Glu PheMet 210 215 220 Glu Met Thr Tyr Glu Glu Ala Thr Ala Pro Glu His Leu AsnIle Asp 225 230 235 240 His Pro Asp Asn Ile Tyr Val Lys Val Ser Lys ArgGly Pro Val Ala 245 250 255 Ile Ile Met Asp Glu Cys Met Asp Lys Leu CysSer Gly Ser Ser Val 260 265 270 Ser Val Leu Phe His Ala Leu Pro Ser LysLeu Phe Ala Arg Ser Ala 275 280 285 His Cys Thr Ala Phe Tyr Ile Phe ValVal Leu His Asn Met Ala Pro 290 295 300 Arg Thr Ala Ile Gly Asn Val ProThr Leu Lys Val Asn Ala Lys Met 305 310 315 320 His Ile Leu Ser Cys HisIle Pro Gln Phe Gln Phe Ala Arg Phe Leu 325 330 335 Tyr Ala Phe Ala HisAsn Ile Ser Lys Asp Leu Val Val Leu Leu Lys 340 345 350 Ala Tyr Phe SerPhe Leu Gln Gln Asn Gln Arg Phe Ser Trp Val Met 355 360 365 Tyr Thr ProAsp Pro Val Ser Glu Ser Phe Arg Trp Cys Ser Ile Asp 370 375 380 Gln GlnTyr Ser Ile Ile Pro Leu Asn Val Asn Ile Gln Glu Arg Phe 385 390 395 400Leu Lys Thr Ala Lys Ser Ile Ile Lys Phe Ser Glu Thr His Arg Lys 405 410415 Gln Leu Glu Arg Asn Pro Lys Leu Thr Asp Leu Glu Lys Leu Ser Pro 420425 430 Pro Gly Thr Phe Gln Glu Thr 435 <210> SEQ ID NO 33 <211> LENGTH:633 <212> TYPE: DNA <213> ORGANISM: CELO VIRUS <220> FEATURE: <221>NAME/KEY: CDS <222> LOCATION: (1)..(633) <400> SEQUENCE: 33 atg cta gaagcc gaa ggt tac aat gct ccg gta gcc atc tac gcc att 48 Met Leu Glu AlaGlu Gly Tyr Asn Ala Pro Val Ala Ile Tyr Ala Ile 1 5 10 15 tat ctg tggatg tct gcc atg agc att agt cgc ctg tgc cat tat act 96 Tyr Leu Trp MetSer Ala Met Ser Ile Ser Arg Leu Cys His Tyr Thr 20 25 30 aac acg ctc tatgtc gta gga gaa cct tcc tct gcc gca gat ata ttc 144 Asn Thr Leu Tyr ValVal Gly Glu Pro Ser Ser Ala Ala Asp Ile Phe 35 40 45 act gca tcc atc ctcaga tta ttc caa ttt gtc ctc act gcc aac att 192 Thr Ala Ser Ile Leu ArgLeu Phe Gln Phe Val Leu Thr Ala Asn Ile 50 55 60 aac gcg ttc gac ttt ggccag tac gcc aga cag caa gat tta gtc aag 240 Asn Ala Phe Asp Phe Gly GlnTyr Ala Arg Gln Gln Asp Leu Val Lys 65 70 75 80 atg ctt tat ttc ccc tgcaca gct cat tgt aac acg ttc aaa gat ccc 288 Met Leu Tyr Phe Pro Cys ThrAla His Cys Asn Thr Phe Lys Asp Pro 85 90 95 gtt gct aac cag ctg ctg aaaggc agg tca ttc acc aca atg acc cgc 336 Val Ala Asn Gln Leu Leu Lys GlyArg Ser Phe Thr Thr Met Thr Arg 100 105 110 gac ggt ctc gtg gac atc agtgag aaa aaa tgc ctc gtc cgc tta tat 384 Asp Gly Leu Val Asp Ile Ser GluLys Lys Cys Leu Val Arg Leu Tyr 115 120 125 cag ctc ccc cat ccc gaa catctg ccc act gct ccc gac gaa cat atc 432 Gln Leu Pro His Pro Glu His LeuPro Thr Ala Pro Asp Glu His Ile 130 135 140 att att agg ttc tac gaa cccgcc aac ggc tgc ggg ttc ttt ctg gga 480 Ile Ile Arg Phe Tyr Glu Pro AlaAsn Gly Cys Gly Phe Phe Leu Gly 145 150 155 160 gag ctc tcc cgc tac attcat cgc ata cac caa tta cag gca gat aat 528 Glu Leu Ser Arg Tyr Ile HisArg Ile His Gln Leu Gln Ala Asp Asn 165 170 175 gac aac gac gcc ttg cgcgct ctc cta tgc gag aac aaa gga atg ctc 576 Asp Asn Asp Ala Leu Arg AlaLeu Leu Cys Glu Asn Lys Gly Met Leu 180 185 190 tgt tcc cgc tcg tgg acctcc cca tgc aat gct tgt cac tca tca cat 624 Cys Ser Arg Ser Trp Thr SerPro Cys Asn Ala Cys His Ser Ser His 195 200 205 gac ata taa 633 Asp Ile210 <210> SEQ ID NO 34 <211> LENGTH: 210 <212> TYPE: PRT <213> ORGANISM:CELO VIRUS <220> FEATURE: <223> OTHER INFORMATION: Position:4462..5094/note=ORF12 <400> SEQUENCE: 34 Met Leu Glu Ala Glu Gly Tyr AsnAla Pro Val Ala Ile Tyr Ala Ile 1 5 10 15 Tyr Leu Trp Met Ser Ala MetSer Ile Ser Arg Leu Cys His Tyr Thr 20 25 30 Asn Thr Leu Tyr Val Val GlyGlu Pro Ser Ser Ala Ala Asp Ile Phe 35 40 45 Thr Ala Ser Ile Leu Arg LeuPhe Gln Phe Val Leu Thr Ala Asn Ile 50 55 60 Asn Ala Phe Asp Phe Gly GlnTyr Ala Arg Gln Gln Asp Leu Val Lys 65 70 75 80 Met Leu Tyr Phe Pro CysThr Ala His Cys Asn Thr Phe Lys Asp Pro 85 90 95 Val Ala Asn Gln Leu LeuLys Gly Arg Ser Phe Thr Thr Met Thr Arg 100 105 110 Asp Gly Leu Val AspIle Ser Glu Lys Lys Cys Leu Val Arg Leu Tyr 115 120 125 Gln Leu Pro HisPro Glu His Leu Pro Thr Ala Pro Asp Glu His Ile 130 135 140 Ile Ile ArgPhe Tyr Glu Pro Ala Asn Gly Cys Gly Phe Phe Leu Gly 145 150 155 160 GluLeu Ser Arg Tyr Ile His Arg Ile His Gln Leu Gln Ala Asp Asn 165 170 175Asp Asn Asp Ala Leu Arg Ala Leu Leu Cys Glu Asn Lys Gly Met Leu 180 185190 Cys Ser Arg Ser Trp Thr Ser Pro Cys Asn Ala Cys His Ser Ser His 195200 205 Asp Ile 210 <210> SEQ ID NO 35 <211> LENGTH: 1020 <212> TYPE:DNA <213> ORGANISM: CELO VIRUS <220> FEATURE: <221> NAME/KEY: CDS <222>LOCATION: (1)..(1020) <400> SEQUENCE: 35 atg aca acg acg cct tgc gcg ctctcc tat gcg aga aca aag gaa tgc 48 Met Thr Thr Thr Pro Cys Ala Leu SerTyr Ala Arg Thr Lys Glu Cys 1 5 10 15 tct gtt ccc gct cgt gga cct ccccat gca atg ctt gtc act cat cac 96 Ser Val Pro Ala Arg Gly Pro Pro HisAla Met Leu Val Thr His His 20 25 30 atg aca tat aat tct ctc cca cag tgcacc aag agg cga cgc gag tct 144 Met Thr Tyr Asn Ser Leu Pro Gln Cys ThrLys Arg Arg Arg Glu Ser 35 40 45 cag tcg tct tta agt agc gaa gag gag caaata gca tcc tgc att cca 192 Gln Ser Ser Leu Ser Ser Glu Glu Glu Gln IleAla Ser Cys Ile Pro 50 55 60 gac acc cct tca ccc tgc tta ttt ccg tcc acgtcc ccc atg gat cag 240 Asp Thr Pro Ser Pro Cys Leu Phe Pro Ser Thr SerPro Met Asp Gln 65 70 75 80 ttg gtt gaa cgg ttg ttt gtc gaa ggt gta gcacac gaa gtc cag tgg 288 Leu Val Glu Arg Leu Phe Val Glu Gly Val Ala HisGlu Val Gln Trp 85 90 95 aac ttc ccg tcc aag aac ctc ata ccc acc tac gaacga gag cgt gta 336 Asn Phe Pro Ser Lys Asn Leu Ile Pro Thr Tyr Glu ArgGlu Arg Val 100 105 110 ctc gaa gcc ctc aag gaa cgg ttc gga ccc gga cagagc ctc att aac 384 Leu Glu Ala Leu Lys Glu Arg Phe Gly Pro Gly Gln SerLeu Ile Asn 115 120 125 cag tta ccc tcc gaa gag ccc gac acc ctc aag gctgcg ttc tac aac 432 Gln Leu Pro Ser Glu Glu Pro Asp Thr Leu Lys Ala AlaPhe Tyr Asn 130 135 140 gtc tgc gac aac tgg ttc cat cag atg atg gaa gccgaa ggc tac gag 480 Val Cys Asp Asn Trp Phe His Gln Met Met Glu Ala GluGly Tyr Glu 145 150 155 160 gga aaa gtg gca gct aac gcc atc ctc cga tggctc cga gga gaa cta 528 Gly Lys Val Ala Ala Asn Ala Ile Leu Arg Trp LeuArg Gly Glu Leu 165 170 175 aac acc ctc gtg ctc tgc gga gga aga ctt tccaac gcc aag agt ctc 576 Asn Thr Leu Val Leu Cys Gly Gly Arg Leu Ser AsnAla Lys Ser Leu 180 185 190 ttt aat gcc tta tgc gcg tgt ttc ccg ctc gcgatc tcc gac agc cga 624 Phe Asn Ala Leu Cys Ala Cys Phe Pro Leu Ala IleSer Asp Ser Arg 195 200 205 atc aac tcc ata cta tca ctg ggc gaa atc gcaccc cac gcc tct cta 672 Ile Asn Ser Ile Leu Ser Leu Gly Glu Ile Ala ProHis Ala Ser Leu 210 215 220 tac tgt ctg ccc ttc gta gac gag aag ccg gacccg ttg atg ctg cac 720 Tyr Cys Leu Pro Phe Val Asp Glu Lys Pro Asp ProLeu Met Leu His 225 230 235 240 ttt atg gaa ggc aat gct gcc acc tgc aggctg aat aag aaa acg ttc 768 Phe Met Glu Gly Asn Ala Ala Thr Cys Arg LeuAsn Lys Lys Thr Phe 245 250 255 cac atc ccc tcg acc ccc atg cta atc cactgc gcg gac ctc tcg ctc 816 His Ile Pro Ser Thr Pro Met Leu Ile His CysAla Asp Leu Ser Leu 260 265 270 gcc aac gag ttc acg gcg cgg aac acg gtcgtc ttc ttc ctc aca gga 864 Ala Asn Glu Phe Thr Ala Arg Asn Thr Val ValPhe Phe Leu Thr Gly 275 280 285 gac cac acc aag acc cct cca tgc tac cacccg cgc aaa gag cta cgc 912 Asp His Thr Lys Thr Pro Pro Cys Tyr His ProArg Lys Glu Leu Arg 290 295 300 gac ttt gtt gct aat gct gcc gct tgt gcttgc tta atg aca ctg cat 960 Asp Phe Val Ala Asn Ala Ala Ala Cys Ala CysLeu Met Thr Leu His 305 310 315 320 tgc aaa cgc gat aat aaa ctc tgt aacccc tgt ata cgt acc cct ctt 1008 Cys Lys Arg Asp Asn Lys Leu Cys Asn ProCys Ile Arg Thr Pro Leu 325 330 335 caa aat cag taa 1020 Gln Asn Gln<210> SEQ ID NO 36 <211> LENGTH: 339 <212> TYPE: PRT <213> ORGANISM:CELO VIRUS <220> FEATURE: <223> OTHER INFORMATION: Position:3549..4568/note=ORF13 <400> SEQUENCE: 36 Met Thr Thr Thr Pro Cys Ala LeuSer Tyr Ala Arg Thr Lys Glu Cys 1 5 10 15 Ser Val Pro Ala Arg Gly ProPro His Ala Met Leu Val Thr His His 20 25 30 Met Thr Tyr Asn Ser Leu ProGln Cys Thr Lys Arg Arg Arg Glu Ser 35 40 45 Gln Ser Ser Leu Ser Ser GluGlu Glu Gln Ile Ala Ser Cys Ile Pro 50 55 60 Asp Thr Pro Ser Pro Cys LeuPhe Pro Ser Thr Ser Pro Met Asp Gln 65 70 75 80 Leu Val Glu Arg Leu PheVal Glu Gly Val Ala His Glu Val Gln Trp 85 90 95 Asn Phe Pro Ser Lys AsnLeu Ile Pro Thr Tyr Glu Arg Glu Arg Val 100 105 110 Leu Glu Ala Leu LysGlu Arg Phe Gly Pro Gly Gln Ser Leu Ile Asn 115 120 125 Gln Leu Pro SerGlu Glu Pro Asp Thr Leu Lys Ala Ala Phe Tyr Asn 130 135 140 Val Cys AspAsn Trp Phe His Gln Met Met Glu Ala Glu Gly Tyr Glu 145 150 155 160 GlyLys Val Ala Ala Asn Ala Ile Leu Arg Trp Leu Arg Gly Glu Leu 165 170 175Asn Thr Leu Val Leu Cys Gly Gly Arg Leu Ser Asn Ala Lys Ser Leu 180 185190 Phe Asn Ala Leu Cys Ala Cys Phe Pro Leu Ala Ile Ser Asp Ser Arg 195200 205 Ile Asn Ser Ile Leu Ser Leu Gly Glu Ile Ala Pro His Ala Ser Leu210 215 220 Tyr Cys Leu Pro Phe Val Asp Glu Lys Pro Asp Pro Leu Met LeuHis 225 230 235 240 Phe Met Glu Gly Asn Ala Ala Thr Cys Arg Leu Asn LysLys Thr Phe 245 250 255 His Ile Pro Ser Thr Pro Met Leu Ile His Cys AlaAsp Leu Ser Leu 260 265 270 Ala Asn Glu Phe Thr Ala Arg Asn Thr Val ValPhe Phe Leu Thr Gly 275 280 285 Asp His Thr Lys Thr Pro Pro Cys Tyr HisPro Arg Lys Glu Leu Arg 290 295 300 Asp Phe Val Ala Asn Ala Ala Ala CysAla Cys Leu Met Thr Leu His 305 310 315 320 Cys Lys Arg Asp Asn Lys LeuCys Asn Pro Cys Ile Arg Thr Pro Leu 325 330 335 Gln Asn Gln <210> SEQ IDNO 37 <211> LENGTH: 483 <212> TYPE: DNA <213> ORGANISM: CELO VIRUS <220>FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (1)..(483) <400> SEQUENCE:37 atg tac ccc ttc aag cac tcg ccc cac tgc att acg gac gaa gag tgt 48Met Tyr Pro Phe Lys His Ser Pro His Cys Ile Thr Asp Glu Glu Cys 1 5 1015 gac ctc cag ctc agg tca ttc tgc agc tgg ata aga gtt att gag atg 96Asp Leu Gln Leu Arg Ser Phe Cys Ser Trp Ile Arg Val Ile Glu Met 20 25 30cga tgt acc gac tgg act atc cag tac atc tgc agc tgc gag aca ccc 144 ArgCys Thr Asp Trp Thr Ile Gln Tyr Ile Cys Ser Cys Glu Thr Pro 35 40 45 cgttcc ctc ttt tgt tta tcc ctc atc cga gtg ctt aca gct cac tgg 192 Arg SerLeu Phe Cys Leu Ser Leu Ile Arg Val Leu Thr Ala His Trp 50 55 60 gcc aaaacg gtc gtc aat ttc gtt gct caa cac gac cac cag ccc caa 240 Ala Lys ThrVal Val Asn Phe Val Ala Gln His Asp His Gln Pro Gln 65 70 75 80 ctc cctctt aat ctc atc tta tac aca tat gct act cac tgc agg tta 288 Leu Pro LeuAsn Leu Ile Leu Tyr Thr Tyr Ala Thr His Cys Arg Leu 85 90 95 tgc aac ttgaac cct gcc ctc gaa caa ata tat aca gca gta acc gtt 336 Cys Asn Leu AsnPro Ala Leu Glu Gln Ile Tyr Thr Ala Val Thr Val 100 105 110 gcg cgg cgccaa ggc gcc tac acg cga ctg gaa gga caa aca ctc tat 384 Ala Arg Arg GlnGly Ala Tyr Thr Arg Leu Glu Gly Gln Thr Leu Tyr 115 120 125 gtc tgt cttcca agg gac atc gta aac tat ccc tgc ata gct tgc ttt 432 Val Cys Leu ProArg Asp Ile Val Asn Tyr Pro Cys Ile Ala Cys Phe 130 135 140 tac cac ctgctt ctg cgg ctc cca gtc gca att aac ttc cac gtg ata 480 Tyr His Leu LeuLeu Arg Leu Pro Val Ala Ile Asn Phe His Val Ile 145 150 155 160 tga 483<210> SEQ ID NO 38 <211> LENGTH: 160 <212> TYPE: PRT <213> ORGANISM:CELO VIRUS <220> FEATURE: <223> OTHER INFORMATION: Position:2892..3374/note=ORF14 <400> SEQUENCE: 38 Met Tyr Pro Phe Lys His Ser ProHis Cys Ile Thr Asp Glu Glu Cys 1 5 10 15 Asp Leu Gln Leu Arg Ser PheCys Ser Trp Ile Arg Val Ile Glu Met 20 25 30 Arg Cys Thr Asp Trp Thr IleGln Tyr Ile Cys Ser Cys Glu Thr Pro 35 40 45 Arg Ser Leu Phe Cys Leu SerLeu Ile Arg Val Leu Thr Ala His Trp 50 55 60 Ala Lys Thr Val Val Asn PheVal Ala Gln His Asp His Gln Pro Gln 65 70 75 80 Leu Pro Leu Asn Leu IleLeu Tyr Thr Tyr Ala Thr His Cys Arg Leu 85 90 95 Cys Asn Leu Asn Pro AlaLeu Glu Gln Ile Tyr Thr Ala Val Thr Val 100 105 110 Ala Arg Arg Gln GlyAla Tyr Thr Arg Leu Glu Gly Gln Thr Leu Tyr 115 120 125 Val Cys Leu ProArg Asp Ile Val Asn Tyr Pro Cys Ile Ala Cys Phe 130 135 140 Tyr His LeuLeu Leu Arg Leu Pro Val Ala Ile Asn Phe His Val Ile 145 150 155 160<210> SEQ ID NO 39 <211> LENGTH: 324 <212> TYPE: DNA <213> ORGANISM:CELO VIRUS <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (1)..(324)<400> SEQUENCE: 39 atg gtc gcg agc tgc cac acg ctc aca atc ata cct aaagaa gca cgc 48 Met Val Ala Ser Cys His Thr Leu Thr Ile Ile Pro Lys GluAla Arg 1 5 10 15 agt aac tgt tac aga gcg tac agc agg gcc tct tgc tggtgc tgc ctt 96 Ser Asn Cys Tyr Arg Ala Tyr Ser Arg Ala Ser Cys Trp CysCys Leu 20 25 30 cgc aca gac aat gtc cgt atg tgt cgt cgt ccc cct caa aatctg ctt 144 Arg Thr Asp Asn Val Arg Met Cys Arg Arg Pro Pro Gln Asn LeuLeu 35 40 45 gca agc gta cag cga agt cga ctt cgt cgc aag ggt cct atc aacggg 192 Ala Ser Val Gln Arg Ser Arg Leu Arg Arg Lys Gly Pro Ile Asn Gly50 55 60 aac cag ggg tca gcc att cca aca cag agc gct gat tgc ggt cta cag240 Asn Gln Gly Ser Ala Ile Pro Thr Gln Ser Ala Asp Cys Gly Leu Gln 6570 75 80 cac cca tac ctg tgg acc cga aac ccg acg ccc cgc ggt ctg tct cgc288 His Pro Tyr Leu Trp Thr Arg Asn Pro Thr Pro Arg Gly Leu Ser Arg 8590 95 cta gct gcg tca gtt ccg aca gct ccg gaa cca taa 324 Leu Ala AlaSer Val Pro Thr Ala Pro Glu Pro 100 105 <210> SEQ ID NO 40 <211> LENGTH:107 <212> TYPE: PRT <213> ORGANISM: CELO VIRUS <220> FEATURE: <223>OTHER INFORMATION: Position:1191..1514/note=ORF15 <400> SEQUENCE: 40 MetVal Ala Ser Cys His Thr Leu Thr Ile Ile Pro Lys Glu Ala Arg 1 5 10 15Ser Asn Cys Tyr Arg Ala Tyr Ser Arg Ala Ser Cys Trp Cys Cys Leu 20 25 30Arg Thr Asp Asn Val Arg Met Cys Arg Arg Pro Pro Gln Asn Leu Leu 35 40 45Ala Ser Val Gln Arg Ser Arg Leu Arg Arg Lys Gly Pro Ile Asn Gly 50 55 60Asn Gln Gly Ser Ala Ile Pro Thr Gln Ser Ala Asp Cys Gly Leu Gln 65 70 7580 His Pro Tyr Leu Trp Thr Arg Asn Pro Thr Pro Arg Gly Leu Ser Arg 85 9095 Leu Ala Ala Ser Val Pro Thr Ala Pro Glu Pro 100 105 <210> SEQ ID NO41 <211> LENGTH: 417 <212> TYPE: DNA <213> ORGANISM: CELO VIRUS <220>FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (1)..(417) <400> SEQUENCE:41 atg tat tac ttc cac ctc cgc gtg acc ttg atg gag cct aac ttg gcc 48Met Tyr Tyr Phe His Leu Arg Val Thr Leu Met Glu Pro Asn Leu Ala 1 5 1015 gta ttc cat gat ctg aaa ttg acg gtg ata aat gcc tgg gaa agt tta 96Val Phe His Asp Leu Lys Leu Thr Val Ile Asn Ala Trp Glu Ser Leu 20 25 30act gtt gag atg ctg tcc cac tat agt gta gat tac ctg ttc cga ttg 144 ThrVal Glu Met Leu Ser His Tyr Ser Val Asp Tyr Leu Phe Arg Leu 35 40 45 gaggag ttt gcg ggg gta tat tca gct tct att ttt ttg ccc acg cat 192 Glu GluPhe Ala Gly Val Tyr Ser Ala Ser Ile Phe Leu Pro Thr His 50 55 60 aag gttgat tgg act ttc ttg aaa agg gcg gtg gct tta ctg cgc gaa 240 Lys Val AspTrp Thr Phe Leu Lys Arg Ala Val Ala Leu Leu Arg Glu 65 70 75 80 tgt atttgg agg aga ttt gaa tgt aca cag gtt ccg cga ggg gtg gct 288 Cys Ile TrpArg Arg Phe Glu Cys Thr Gln Val Pro Arg Gly Val Ala 85 90 95 tct att tacgcg gtg cgc aat acg tgg acc ccc tcc gcc aat agg gtg 336 Ser Ile Tyr AlaVal Arg Asn Thr Trp Thr Pro Ser Ala Asn Arg Val 100 105 110 gcc cgt cacttt gta aaa cgc ggg gca ttg gtt ggc atg cag ccc tgt 384 Ala Arg His PheVal Lys Arg Gly Ala Leu Val Gly Met Gln Pro Cys 115 120 125 tta cac gaatgt acc tat gag cgg gat gcc tgt 417 Leu His Glu Cys Thr Tyr Glu Arg AspAla Cys 130 135 <210> SEQ ID NO 42 <211> LENGTH: 139 <212> TYPE: PRT<213> ORGANISM: CELO VIRUS <220> FEATURE: <223> OTHER INFORMATION:Position:39286..39705/note=ORF16 <400> SEQUENCE: 42 Met Tyr Tyr Phe HisLeu Arg Val Thr Leu Met Glu Pro Asn Leu Ala 1 5 10 15 Val Phe His AspLeu Lys Leu Thr Val Ile Asn Ala Trp Glu Ser Leu 20 25 30 Thr Val Glu MetLeu Ser His Tyr Ser Val Asp Tyr Leu Phe Arg Leu 35 40 45 Glu Glu Phe AlaGly Val Tyr Ser Ala Ser Ile Phe Leu Pro Thr His 50 55 60 Lys Val Asp TrpThr Phe Leu Lys Arg Ala Val Ala Leu Leu Arg Glu 65 70 75 80 Cys Ile TrpArg Arg Phe Glu Cys Thr Gln Val Pro Arg Gly Val Ala 85 90 95 Ser Ile TyrAla Val Arg Asn Thr Trp Thr Pro Ser Ala Asn Arg Val 100 105 110 Ala ArgHis Phe Val Lys Arg Gly Ala Leu Val Gly Met Gln Pro Cys 115 120 125 LeuHis Glu Cys Thr Tyr Glu Arg Asp Ala Cys 130 135 <210> SEQ ID NO 43 <211>LENGTH: 540 <212> TYPE: DNA <213> ORGANISM: CELO VIRUS <220> FEATURE:<221> NAME/KEY: CDS <222> LOCATION: (1)..(540) <400> SEQUENCE: 43 atgcct ttg tat ttg tgc ttt ggg gcc gcc gcc ccg gtt tcg att ttg 48 Met ProLeu Tyr Leu Cys Phe Gly Ala Ala Ala Pro Val Ser Ile Leu 1 5 10 15 tggcgg gaa gaa ctc ttc tgg gga ttc gtg gct gcg gtc aag agg agg 96 Trp ArgGlu Glu Leu Phe Trp Gly Phe Val Ala Ala Val Lys Arg Arg 20 25 30 tgg cacact gta tat gca cgg acc aat gtg gac att cag tat ccg atg 144 Trp His ThrVal Tyr Ala Arg Thr Asn Val Asp Ile Gln Tyr Pro Met 35 40 45 gcg tat tgtgtc ggt atc caa tcc ctg tct cca tgc aaa tgt cat gtg 192 Ala Tyr Cys ValGly Ile Gln Ser Leu Ser Pro Cys Lys Cys His Val 50 55 60 acc gtg gtg gtgtgt ctg acc ttt ctg gat ctg cgc atg tcc gct att 240 Thr Val Val Val CysLeu Thr Phe Leu Asp Leu Arg Met Ser Ala Ile 65 70 75 80 aat gaa gcc acgaaa ata atg cgc gcg ttt ttc aaa acc ttt ttc tac 288 Asn Glu Ala Thr LysIle Met Arg Ala Phe Phe Lys Thr Phe Phe Tyr 85 90 95 cac cac ggg aaa gtcccg cgt ggg cgg tgg ttt aaa ttg tac aga aat 336 His His Gly Lys Val ProArg Gly Arg Trp Phe Lys Leu Tyr Arg Asn 100 105 110 gat tgg tgt aag gatcct aat tta aca gtg ggt aat tac att gtg gca 384 Asp Trp Cys Lys Asp ProAsn Leu Thr Val Gly Asn Tyr Ile Val Ala 115 120 125 tcg ggg gcg tta cctttg atg ctg ggg tgg gcg cgg tct acg ggg ttg 432 Ser Gly Ala Leu Pro LeuMet Leu Gly Trp Ala Arg Ser Thr Gly Leu 130 135 140 cgg ttc agc aca tttaca tat tca gat gag gct ctg tgg agt cat aga 480 Arg Phe Ser Thr Phe ThrTyr Ser Asp Glu Ala Leu Trp Ser His Arg 145 150 155 160 cgg aga gat aggagg ctt gcc cgt cgg cgg gaa aag ctt gaa aat aaa 528 Arg Arg Asp Arg ArgLeu Ala Arg Arg Arg Glu Lys Leu Glu Asn Lys 165 170 175 gta tca ggt tga540 Val Ser Gly <210> SEQ ID NO 44 <211> LENGTH: 179 <212> TYPE: PRT<213> ORGANISM: CELO VIRUS <220> FEATURE: <223> OTHER INFORMATION:Position:38717..39256/note=ORF17 <400> SEQUENCE: 44 Met Pro Leu Tyr LeuCys Phe Gly Ala Ala Ala Pro Val Ser Ile Leu 1 5 10 15 Trp Arg Glu GluLeu Phe Trp Gly Phe Val Ala Ala Val Lys Arg Arg 20 25 30 Trp His Thr ValTyr Ala Arg Thr Asn Val Asp Ile Gln Tyr Pro Met 35 40 45 Ala Tyr Cys ValGly Ile Gln Ser Leu Ser Pro Cys Lys Cys His Val 50 55 60 Thr Val Val ValCys Leu Thr Phe Leu Asp Leu Arg Met Ser Ala Ile 65 70 75 80 Asn Glu AlaThr Lys Ile Met Arg Ala Phe Phe Lys Thr Phe Phe Tyr 85 90 95 His His GlyLys Val Pro Arg Gly Arg Trp Phe Lys Leu Tyr Arg Asn 100 105 110 Asp TrpCys Lys Asp Pro Asn Leu Thr Val Gly Asn Tyr Ile Val Ala 115 120 125 SerGly Ala Leu Pro Leu Met Leu Gly Trp Ala Arg Ser Thr Gly Leu 130 135 140Arg Phe Ser Thr Phe Thr Tyr Ser Asp Glu Ala Leu Trp Ser His Arg 145 150155 160 Arg Arg Asp Arg Arg Leu Ala Arg Arg Arg Glu Lys Leu Glu Asn Lys165 170 175 Val Ser Gly <210> SEQ ID NO 45 <211> LENGTH: 609 <212> TYPE:DNA <213> ORGANISM: CELO VIRUS <220> FEATURE: <221> NAME/KEY: CDS <222>LOCATION: (1)..(609) <400> SEQUENCE: 45 atg tct gcc cta tct tcg tgc tttaat ggg tcg gat tcc aga tgg gat 48 Met Ser Ala Leu Ser Ser Cys Phe AsnGly Ser Asp Ser Arg Trp Asp 1 5 10 15 ccg cca tat cct aag gct gac gtcagg cgc ttg atg ggc acc tat tcg 96 Pro Pro Tyr Pro Lys Ala Asp Val ArgArg Leu Met Gly Thr Tyr Ser 20 25 30 ccg gat ttt cct tcg tgg ccc aag ttaatt gta tgg tgg aat gag act 144 Pro Asp Phe Pro Ser Trp Pro Lys Leu IleVal Trp Trp Asn Glu Thr 35 40 45 ttt ttg act ttt tcg gac ggc ccc tgg gttgtc agt caa atg cgg cgg 192 Phe Leu Thr Phe Ser Asp Gly Pro Trp Val ValSer Gln Met Arg Arg 50 55 60 ctc ggg gta ttg gat ggt aaa gat agc ggg gagctc att att ctg gtt 240 Leu Gly Val Leu Asp Gly Lys Asp Ser Gly Glu LeuIle Ile Leu Val 65 70 75 80 cag gac atg tat ccc gat gtg tgt ccg ctt attaat agg gcg cgc tat 288 Gln Asp Met Tyr Pro Asp Val Cys Pro Leu Ile AsnArg Ala Arg Tyr 85 90 95 gac ggc aca tat aaa tgg acc agt gaa atg atg agaaag att ttg cgt 336 Asp Gly Thr Tyr Lys Trp Thr Ser Glu Met Met Arg LysIle Leu Arg 100 105 110 atg cat acc att atg acg cca gag tcc ccg gtc attctg ttg gac tgg 384 Met His Thr Ile Met Thr Pro Glu Ser Pro Val Ile LeuLeu Asp Trp 115 120 125 acc aat cag ctg aga gat att tgt aag aag gta gacgcc ctt ttg tgg 432 Thr Asn Gln Leu Arg Asp Ile Cys Lys Lys Val Asp AlaLeu Leu Trp 130 135 140 ggg cag gat gtg agg ggg ccg gcc tat tac gca gtcagg acc act gct 480 Gly Gln Asp Val Arg Gly Pro Ala Tyr Tyr Ala Val ArgThr Thr Ala 145 150 155 160 cat ttt ttt acg gag ttc aag gac cat cga attcat tgc ata ggg atg 528 His Phe Phe Thr Glu Phe Lys Asp His Arg Ile HisCys Ile Gly Met 165 170 175 tcg cta ggg ggc act gta tgc gcg gct ttg tcccgc caa ctt cta gtc 576 Ser Leu Gly Gly Thr Val Cys Ala Ala Leu Ser ArgGln Leu Leu Val 180 185 190 cgg aca gag ggt caa aaa agg ttg gcc gca tag609 Arg Thr Glu Gly Gln Lys Arg Leu Ala Ala 195 200 <210> SEQ ID NO 46<211> LENGTH: 202 <212> TYPE: PRT <213> ORGANISM: CELO VIRUS <220>FEATURE: <223> OTHER INFORMATION: Position:35536..36144/note=ORF18 <400>SEQUENCE: 46 Met Ser Ala Leu Ser Ser Cys Phe Asn Gly Ser Asp Ser Arg TrpAsp 1 5 10 15 Pro Pro Tyr Pro Lys Ala Asp Val Arg Arg Leu Met Gly ThrTyr Ser 20 25 30 Pro Asp Phe Pro Ser Trp Pro Lys Leu Ile Val Trp Trp AsnGlu Thr 35 40 45 Phe Leu Thr Phe Ser Asp Gly Pro Trp Val Val Ser Gln MetArg Arg 50 55 60 Leu Gly Val Leu Asp Gly Lys Asp Ser Gly Glu Leu Ile IleLeu Val 65 70 75 80 Gln Asp Met Tyr Pro Asp Val Cys Pro Leu Ile Asn ArgAla Arg Tyr 85 90 95 Asp Gly Thr Tyr Lys Trp Thr Ser Glu Met Met Arg LysIle Leu Arg 100 105 110 Met His Thr Ile Met Thr Pro Glu Ser Pro Val IleLeu Leu Asp Trp 115 120 125 Thr Asn Gln Leu Arg Asp Ile Cys Lys Lys ValAsp Ala Leu Leu Trp 130 135 140 Gly Gln Asp Val Arg Gly Pro Ala Tyr TyrAla Val Arg Thr Thr Ala 145 150 155 160 His Phe Phe Thr Glu Phe Lys AspHis Arg Ile His Cys Ile Gly Met 165 170 175 Ser Leu Gly Gly Thr Val CysAla Ala Leu Ser Arg Gln Leu Leu Val 180 185 190 Arg Thr Glu Gly Gln LysArg Leu Ala Ala 195 200 <210> SEQ ID NO 47 <211> LENGTH: 1362 <212>TYPE: DNA <213> ORGANISM: CELO VIRUS <220> FEATURE: <221> NAME/KEY: CDS<222> LOCATION: (1)..(1362) <400> SEQUENCE: 47 atg cgc ggc ttt gtc ccgcca act tct agt ccg gac aga ggg tca aaa 48 Met Arg Gly Phe Val Pro ProThr Ser Ser Pro Asp Arg Gly Ser Lys 1 5 10 15 aag gtt ggc cgc ata gtggcg cta gat ccc cca ttg gaa agt ttc cag 96 Lys Val Gly Arg Ile Val AlaLeu Asp Pro Pro Leu Glu Ser Phe Gln 20 25 30 ggt tat agg atg gac tta cacaca aaa ggg tta aac ctt ttg ctg tct 144 Gly Tyr Arg Met Asp Leu His ThrLys Gly Leu Asn Leu Leu Leu Ser 35 40 45 tcc ggt ggt cat tgg tca gcc aatagg gat gcg gat agt gtt atc tct 192 Ser Gly Gly His Trp Ser Ala Asn ArgAsp Ala Asp Ser Val Ile Ser 50 55 60 agg gat gac gcg gat tac gta gtg gtgatt gcg tct agt att ggt tcg 240 Arg Asp Asp Ala Asp Tyr Val Val Val IleAla Ser Ser Ile Gly Ser 65 70 75 80 tac ggt ttt gat cga cca ata gga gacgag tat ata cgg agc gat ctg 288 Tyr Gly Phe Asp Arg Pro Ile Gly Asp GluTyr Ile Arg Ser Asp Leu 85 90 95 act ggt caa aaa cat gag gct tgc gag tcacgt gct tgg tgg aaa ggt 336 Thr Gly Gln Lys His Glu Ala Cys Glu Ser ArgAla Trp Trp Lys Gly 100 105 110 caa atc tgt gca tgg tct tat tct ggg aggcgc cat tgt gaa gat gta 384 Gln Ile Cys Ala Trp Ser Tyr Ser Gly Arg ArgHis Cys Glu Asp Val 115 120 125 cat att ccc ttt gat ttc ctc cga tcg gatgga ctg tgt tat cac att 432 His Ile Pro Phe Asp Phe Leu Arg Ser Asp GlyLeu Cys Tyr His Ile 130 135 140 atg gcg cct ttg acc ttt atg aag gcc ctggat act cac cag gct gac 480 Met Ala Pro Leu Thr Phe Met Lys Ala Leu AspThr His Gln Ala Asp 145 150 155 160 cag cta ctg agc atg cac gga agt gttcct tcc gcg tgg tca gcc tac 528 Gln Leu Leu Ser Met His Gly Ser Val ProSer Ala Trp Ser Ala Tyr 165 170 175 gtc acg ggg cgt gat tac agt cag ccaact cag tac tac acg gaa gag 576 Val Thr Gly Arg Asp Tyr Ser Gln Pro ThrGln Tyr Tyr Thr Glu Glu 180 185 190 gta gct gat tgg agg atg ctt tta cgagag gat gac atg gca tct tcc 624 Val Ala Asp Trp Arg Met Leu Leu Arg GluAsp Asp Met Ala Ser Ser 195 200 205 tat ttg ctg ttg gtg gtg aca gag ggcaat gcc gcg gag ttg tgg act 672 Tyr Leu Leu Leu Val Val Thr Glu Gly AsnAla Ala Glu Leu Trp Thr 210 215 220 tat gac cct tat tat act aaa aca ataggg atg gaa cac ggg tat tcg 720 Tyr Asp Pro Tyr Tyr Thr Lys Thr Ile GlyMet Glu His Gly Tyr Ser 225 230 235 240 gtc aga tgg tat ttt att agg gatagg aat gtg ggc gag gct ccc att 768 Val Arg Trp Tyr Phe Ile Arg Asp ArgAsn Val Gly Glu Ala Pro Ile 245 250 255 gtt tta tat gct agg ggc ggg ggtgta tta aaa ttt att aga ctg tac 816 Val Leu Tyr Ala Arg Gly Gly Gly ValLeu Lys Phe Ile Arg Leu Tyr 260 265 270 aag ggg cgt ggc act ctg acg tcacta ggg gcg agg gca atg acg aca 864 Lys Gly Arg Gly Thr Leu Thr Ser LeuGly Ala Arg Ala Met Thr Thr 275 280 285 cag gaa gtg acg gag ttt acg tgtttc cgg act cac acc tat tac ttt 912 Gln Glu Val Thr Glu Phe Thr Cys PheArg Thr His Thr Tyr Tyr Phe 290 295 300 acc gga act aag aag tac gat tgccat cca ggc ggg cac cgc ttt gat 960 Thr Gly Thr Lys Lys Tyr Asp Cys HisPro Gly Gly His Arg Phe Asp 305 310 315 320 gtc cct aga tgg cgc tct catatc aat gtt tct gcg cac cat ctt cct 1008 Val Pro Arg Trp Arg Ser His IleAsn Val Ser Ala His His Leu Pro 325 330 335 gtc ccg ccc aaa tgt ggc tgtttg aag ttc ccc aaa ttg ttt aag gat 1056 Val Pro Pro Lys Cys Gly Cys LeuLys Phe Pro Lys Leu Phe Lys Asp 340 345 350 tat gtc ata ttt gat cac ccgaat gta gtg ggc agg gcc gga gaa tat 1104 Tyr Val Ile Phe Asp His Pro AsnVal Val Gly Arg Ala Gly Glu Tyr 355 360 365 gtt agt tta ggg ccc tgg agtacc ggg tta cag gcc gta gtg acc ttt 1152 Val Ser Leu Gly Pro Trp Ser ThrGly Leu Gln Ala Val Val Thr Phe 370 375 380 aaa cct caa cct cga cgt caccga gtg gtc ctg gct acg tac tgg gat 1200 Lys Pro Gln Pro Arg Arg His ArgVal Val Leu Ala Thr Tyr Trp Asp 385 390 395 400 gcc tgt tca aac acc aagagg cgt gtc ggc att gac gtc aga acg gac 1248 Ala Cys Ser Asn Thr Lys ArgArg Val Gly Ile Asp Val Arg Thr Asp 405 410 415 cgg aag aat cac atg gtttgg ctc aag gcg gac aag cct gtg tcc aga 1296 Arg Lys Asn His Met Val TrpLeu Lys Ala Asp Lys Pro Val Ser Arg 420 425 430 gag atg tgg ttt gta tcggaa gtg gac gtc gtt cga gtc tac gtc acg 1344 Glu Met Trp Phe Val Ser GluVal Asp Val Val Arg Val Tyr Val Thr 435 440 445 tgg ctc tcc ccc gaa taa1362 Trp Leu Ser Pro Glu 450 <210> SEQ ID NO 48 <211> LENGTH: 453 <212>TYPE: PRT <213> ORGANISM: CELO VIRUS <220> FEATURE: <223> OTHERINFORMATION: Position:34238..35599/note=ORF19 <400> SEQUENCE: 48 Met ArgGly Phe Val Pro Pro Thr Ser Ser Pro Asp Arg Gly Ser Lys 1 5 10 15 LysVal Gly Arg Ile Val Ala Leu Asp Pro Pro Leu Glu Ser Phe Gln 20 25 30 GlyTyr Arg Met Asp Leu His Thr Lys Gly Leu Asn Leu Leu Leu Ser 35 40 45 SerGly Gly His Trp Ser Ala Asn Arg Asp Ala Asp Ser Val Ile Ser 50 55 60 ArgAsp Asp Ala Asp Tyr Val Val Val Ile Ala Ser Ser Ile Gly Ser 65 70 75 80Tyr Gly Phe Asp Arg Pro Ile Gly Asp Glu Tyr Ile Arg Ser Asp Leu 85 90 95Thr Gly Gln Lys His Glu Ala Cys Glu Ser Arg Ala Trp Trp Lys Gly 100 105110 Gln Ile Cys Ala Trp Ser Tyr Ser Gly Arg Arg His Cys Glu Asp Val 115120 125 His Ile Pro Phe Asp Phe Leu Arg Ser Asp Gly Leu Cys Tyr His Ile130 135 140 Met Ala Pro Leu Thr Phe Met Lys Ala Leu Asp Thr His Gln AlaAsp 145 150 155 160 Gln Leu Leu Ser Met His Gly Ser Val Pro Ser Ala TrpSer Ala Tyr 165 170 175 Val Thr Gly Arg Asp Tyr Ser Gln Pro Thr Gln TyrTyr Thr Glu Glu 180 185 190 Val Ala Asp Trp Arg Met Leu Leu Arg Glu AspAsp Met Ala Ser Ser 195 200 205 Tyr Leu Leu Leu Val Val Thr Glu Gly AsnAla Ala Glu Leu Trp Thr 210 215 220 Tyr Asp Pro Tyr Tyr Thr Lys Thr IleGly Met Glu His Gly Tyr Ser 225 230 235 240 Val Arg Trp Tyr Phe Ile ArgAsp Arg Asn Val Gly Glu Ala Pro Ile 245 250 255 Val Leu Tyr Ala Arg GlyGly Gly Val Leu Lys Phe Ile Arg Leu Tyr 260 265 270 Lys Gly Arg Gly ThrLeu Thr Ser Leu Gly Ala Arg Ala Met Thr Thr 275 280 285 Gln Glu Val ThrGlu Phe Thr Cys Phe Arg Thr His Thr Tyr Tyr Phe 290 295 300 Thr Gly ThrLys Lys Tyr Asp Cys His Pro Gly Gly His Arg Phe Asp 305 310 315 320 ValPro Arg Trp Arg Ser His Ile Asn Val Ser Ala His His Leu Pro 325 330 335Val Pro Pro Lys Cys Gly Cys Leu Lys Phe Pro Lys Leu Phe Lys Asp 340 345350 Tyr Val Ile Phe Asp His Pro Asn Val Val Gly Arg Ala Gly Glu Tyr 355360 365 Val Ser Leu Gly Pro Trp Ser Thr Gly Leu Gln Ala Val Val Thr Phe370 375 380 Lys Pro Gln Pro Arg Arg His Arg Val Val Leu Ala Thr Tyr TrpAsp 385 390 395 400 Ala Cys Ser Asn Thr Lys Arg Arg Val Gly Ile Asp ValArg Thr Asp 405 410 415 Arg Lys Asn His Met Val Trp Leu Lys Ala Asp LysPro Val Ser Arg 420 425 430 Glu Met Trp Phe Val Ser Glu Val Asp Val ValArg Val Tyr Val Thr 435 440 445 Trp Leu Ser Pro Glu 450 <210> SEQ ID NO49 <211> LENGTH: 816 <212> TYPE: DNA <213> ORGANISM: CELO VIRUS <220>FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (1)..(816) <400> SEQUENCE:49 atg gag aga ctg aac gag tac cgc ata aat aga gcc gtg gct agc ttg 48Met Glu Arg Leu Asn Glu Tyr Arg Ile Asn Arg Ala Val Ala Ser Leu 1 5 1015 cgg tgt ttc gat aat gat ctg atg agg cga ttg cat agt tct gtc acg 96Arg Cys Phe Asp Asn Asp Leu Met Arg Arg Leu His Ser Ser Val Thr 20 25 30gtg cta gtg acg gta cgc agc gca aag ttt gtg tgt ttc aaa cgg cga 144 ValLeu Val Thr Val Arg Ser Ala Lys Phe Val Cys Phe Lys Arg Arg 35 40 45 gactac gta ctc atg aat tgc ata gtc cgg att gtg agt gcc ctt cac 192 Asp TyrVal Leu Met Asn Cys Ile Val Arg Ile Val Ser Ala Leu His 50 55 60 ctg aaccgg gca gag aag acc gcc ctg ctg cac tac ctc tca cgt agg 240 Leu Asn ArgAla Glu Lys Thr Ala Leu Leu His Tyr Leu Ser Arg Arg 65 70 75 80 ttg cttttt att aca cct ggg atg aag tac gac ttg gaa ccg tgg atg 288 Leu Leu PheIle Thr Pro Gly Met Lys Tyr Asp Leu Glu Pro Trp Met 85 90 95 ctt gct cgcagg aag aca gat ttt aag ttt ttc acc aca ggc ttt ctg 336 Leu Ala Arg ArgLys Thr Asp Phe Lys Phe Phe Thr Thr Gly Phe Leu 100 105 110 att gcg gagaag ata tcc gta aag atg gct ctc cgc tcg atg agc ttt 384 Ile Ala Glu LysIle Ser Val Lys Met Ala Leu Arg Ser Met Ser Phe 115 120 125 gag gtg tccttt tcg caa gtg cct tcg tct gtt cct ttt gtg cgg tct 432 Glu Val Ser PheSer Gln Val Pro Ser Ser Val Pro Phe Val Arg Ser 130 135 140 ccg gtt gttctc atg aat gcg tgt cgc gtg acc gtg acg gcc acc atc 480 Pro Val Val LeuMet Asn Ala Cys Arg Val Thr Val Thr Ala Thr Ile 145 150 155 160 atg gtggaa act att tct cgc agc agc gcc gtg acc caa ccc gtc tgc 528 Met Val GluThr Ile Ser Arg Ser Ser Ala Val Thr Gln Pro Val Cys 165 170 175 ctg agaagc atg ctc cgc gtg atg gtg tcg ccg gaa ctg tgg ccg atc 576 Leu Arg SerMet Leu Arg Val Met Val Ser Pro Glu Leu Trp Pro Ile 180 185 190 gtg tcgcag gga ctg tgt tac ttc ccc ggt tac cgt cgg ttg tcc tac 624 Val Ser GlnGly Leu Cys Tyr Phe Pro Gly Tyr Arg Arg Leu Ser Tyr 195 200 205 gct aacgtc gaa gag tgg gta ttt cat gtg cac ggg aag tac ggg gag 672 Ala Asn ValGlu Glu Trp Val Phe His Val His Gly Lys Tyr Gly Glu 210 215 220 tct catccc gag tgt ttc gga cag tgc aaa cag tgt tcg acg cgg caa 720 Ser His ProGlu Cys Phe Gly Gln Cys Lys Gln Cys Ser Thr Arg Gln 225 230 235 240 cctctc tct ctg ttc tgt tct gct cag ttg gct tat ctg cgc aat gtg 768 Pro LeuSer Leu Phe Cys Ser Ala Gln Leu Ala Tyr Leu Arg Asn Val 245 250 255 tttatg gaa cga cgc gcg aga gtc gct ggt gaa cgt ccg tat agc taa 816 Phe MetGlu Arg Arg Ala Arg Val Ala Gly Glu Arg Pro Tyr Ser 260 265 270 <210>SEQ ID NO 50 <211> LENGTH: 271 <212> TYPE: PRT <213> ORGANISM: CELOVIRUS <220> FEATURE: <223> OTHER INFORMATION:Position:32892..33707/note=ORF20 <400> SEQUENCE: 50 Met Glu Arg Leu AsnGlu Tyr Arg Ile Asn Arg Ala Val Ala Ser Leu 1 5 10 15 Arg Cys Phe AspAsn Asp Leu Met Arg Arg Leu His Ser Ser Val Thr 20 25 30 Val Leu Val ThrVal Arg Ser Ala Lys Phe Val Cys Phe Lys Arg Arg 35 40 45 Asp Tyr Val LeuMet Asn Cys Ile Val Arg Ile Val Ser Ala Leu His 50 55 60 Leu Asn Arg AlaGlu Lys Thr Ala Leu Leu His Tyr Leu Ser Arg Arg 65 70 75 80 Leu Leu PheIle Thr Pro Gly Met Lys Tyr Asp Leu Glu Pro Trp Met 85 90 95 Leu Ala ArgArg Lys Thr Asp Phe Lys Phe Phe Thr Thr Gly Phe Leu 100 105 110 Ile AlaGlu Lys Ile Ser Val Lys Met Ala Leu Arg Ser Met Ser Phe 115 120 125 GluVal Ser Phe Ser Gln Val Pro Ser Ser Val Pro Phe Val Arg Ser 130 135 140Pro Val Val Leu Met Asn Ala Cys Arg Val Thr Val Thr Ala Thr Ile 145 150155 160 Met Val Glu Thr Ile Ser Arg Ser Ser Ala Val Thr Gln Pro Val Cys165 170 175 Leu Arg Ser Met Leu Arg Val Met Val Ser Pro Glu Leu Trp ProIle 180 185 190 Val Ser Gln Gly Leu Cys Tyr Phe Pro Gly Tyr Arg Arg LeuSer Tyr 195 200 205 Ala Asn Val Glu Glu Trp Val Phe His Val His Gly LysTyr Gly Glu 210 215 220 Ser His Pro Glu Cys Phe Gly Gln Cys Lys Gln CysSer Thr Arg Gln 225 230 235 240 Pro Leu Ser Leu Phe Cys Ser Ala Gln LeuAla Tyr Leu Arg Asn Val 245 250 255 Phe Met Glu Arg Arg Ala Arg Val AlaGly Glu Arg Pro Tyr Ser 260 265 270 <210> SEQ ID NO 51 <211> LENGTH: 324<212> TYPE: DNA <213> ORGANISM: CELO VIRUS <220> FEATURE: <221>NAME/KEY: CDS <222> LOCATION: (1)..(324) <400> SEQUENCE: 51 atg tgc acggga agt acg ggg agt ctc atc ccg agt gtt tcg gac agt 48 Met Cys Thr GlySer Thr Gly Ser Leu Ile Pro Ser Val Ser Asp Ser 1 5 10 15 gca aac agtgtt cga cgc ggc aac ctc tct ctc tgt tct gtt ctg ctc 96 Ala Asn Ser ValArg Arg Gly Asn Leu Ser Leu Cys Ser Val Leu Leu 20 25 30 agt tgg ctt atctgc gca atg tgt tta tgg aac gac gcg cga gag tcg 144 Ser Trp Leu Ile CysAla Met Cys Leu Trp Asn Asp Ala Arg Glu Ser 35 40 45 ctg gtg aac gtc cgtata gct aat tac gtg ttt gat ttt gca gtg ttg 192 Leu Val Asn Val Arg IleAla Asn Tyr Val Phe Asp Phe Ala Val Leu 50 55 60 tgg acg cta ttg gcg cgagtt ctt ggc cct cct ggt cgc cct gtc cta 240 Trp Thr Leu Leu Ala Arg ValLeu Gly Pro Pro Gly Arg Pro Val Leu 65 70 75 80 cag cag cat cat cct gtgcag ctt cct gtt cct aca gaa cca tct gtc 288 Gln Gln His His Pro Val GlnLeu Pro Val Pro Thr Glu Pro Ser Val 85 90 95 ttc gtt aaa ctt tgt aat cagcgt gtt cgt ttg tag 324 Phe Val Lys Leu Cys Asn Gln Arg Val Arg Leu 100105 <210> SEQ ID NO 52 <211> LENGTH: 107 <212> TYPE: PRT <213> ORGANISM:CELO VIRUS <220> FEATURE: <223> OTHER INFORMATION:Position:32735..33058/note=ORF21 <400> SEQUENCE: 52 Met Cys Thr Gly SerThr Gly Ser Leu Ile Pro Ser Val Ser Asp Ser 1 5 10 15 Ala Asn Ser ValArg Arg Gly Asn Leu Ser Leu Cys Ser Val Leu Leu 20 25 30 Ser Trp Leu IleCys Ala Met Cys Leu Trp Asn Asp Ala Arg Glu Ser 35 40 45 Leu Val Asn ValArg Ile Ala Asn Tyr Val Phe Asp Phe Ala Val Leu 50 55 60 Trp Thr Leu LeuAla Arg Val Leu Gly Pro Pro Gly Arg Pro Val Leu 65 70 75 80 Gln Gln HisHis Pro Val Gln Leu Pro Val Pro Thr Glu Pro Ser Val 85 90 95 Phe Val LysLeu Cys Asn Gln Arg Val Arg Leu 100 105 <210> SEQ ID NO 53 <211> LENGTH:618 <212> TYPE: DNA <213> ORGANISM: CELO VIRUS <220> FEATURE: <221>NAME/KEY: CDS <222> LOCATION: (1)..(618) <400> SEQUENCE: 53 atg aac gacgag cag atc ctg gag atg gtg ctg cag cac cag cag cgc 48 Met Asn Asp GluGln Ile Leu Glu Met Val Leu Gln His Gln Gln Arg 1 5 10 15 cgc caa caggaa gcg gag cgc gag gag gaa gtt ggg gat gac atg gaa 96 Arg Gln Gln GluAla Glu Arg Glu Glu Glu Val Gly Asp Asp Met Glu 20 25 30 gac gac gaa gatgat gac ggt ctt cag atg ccg acg ccg ctt cat gcc 144 Asp Asp Glu Asp AspAsp Gly Leu Gln Met Pro Thr Pro Leu His Ala 35 40 45 tat cag cta ctg tgttac gat tct ttc gaa ctt cat ttc ggg gga tgc 192 Tyr Gln Leu Leu Cys TyrAsp Ser Phe Glu Leu His Phe Gly Gly Cys 50 55 60 gct tgc cac ggg tta cctttg cat cgt atg ggg tta tcg gct tgc cac 240 Ala Cys His Gly Leu Pro LeuHis Arg Met Gly Leu Ser Ala Cys His 65 70 75 80 ctg gct cct tcc gat ttggcc act tat gtt tgg gcc agg ttg gag gat 288 Leu Ala Pro Ser Asp Leu AlaThr Tyr Val Trp Ala Arg Leu Glu Asp 85 90 95 gac ttg aat gtg gca ggg gtgtac ttc gtg gct atg tgg gcg tca ccg 336 Asp Leu Asn Val Ala Gly Val TyrPhe Val Ala Met Trp Ala Ser Pro 100 105 110 ggg ttt agc gat ttc tct ccagta ttt atg cag cga ccg atc ggg aac 384 Gly Phe Ser Asp Phe Ser Pro ValPhe Met Gln Arg Pro Ile Gly Asn 115 120 125 gtg tgc ggg atg tta att cacgtg gac ctg cac agc agg cta cca ttc 432 Val Cys Gly Met Leu Ile His ValAsp Leu His Ser Arg Leu Pro Phe 130 135 140 cta att gcg gtg tcg cgc ttgggg gag gcg ggt ggc agc ccc tgt ctg 480 Leu Ile Ala Val Ser Arg Leu GlyGlu Ala Gly Gly Ser Pro Cys Leu 145 150 155 160 tat atg agg aaa att gatgtt gat ttg gac acg cag cgc gta cat ttt 528 Tyr Met Arg Lys Ile Asp ValAsp Leu Asp Thr Gln Arg Val His Phe 165 170 175 tat aca gaa gat tgg ttcagt gag ttt gcg aat ctg ctg tat tac tgg 576 Tyr Thr Glu Asp Trp Phe SerGlu Phe Ala Asn Leu Leu Tyr Tyr Trp 180 185 190 caa atg agc gaa tgg aaacat tta gcg gag cgt atg caa taa 618 Gln Met Ser Glu Trp Lys His Leu AlaGlu Arg Met Gln 195 200 205 <210> SEQ ID NO 54 <211> LENGTH: 205 <212>TYPE: PRT <213> ORGANISM: CELO VIRUS <220> FEATURE: <223> OTHERINFORMATION: Position:31812..32429/note=ORF22 <400> SEQUENCE: 54 Met AsnAsp Glu Gln Ile Leu Glu Met Val Leu Gln His Gln Gln Arg 1 5 10 15 ArgGln Gln Glu Ala Glu Arg Glu Glu Glu Val Gly Asp Asp Met Glu 20 25 30 AspAsp Glu Asp Asp Asp Gly Leu Gln Met Pro Thr Pro Leu His Ala 35 40 45 TyrGln Leu Leu Cys Tyr Asp Ser Phe Glu Leu His Phe Gly Gly Cys 50 55 60 AlaCys His Gly Leu Pro Leu His Arg Met Gly Leu Ser Ala Cys His 65 70 75 80Leu Ala Pro Ser Asp Leu Ala Thr Tyr Val Trp Ala Arg Leu Glu Asp 85 90 95Asp Leu Asn Val Ala Gly Val Tyr Phe Val Ala Met Trp Ala Ser Pro 100 105110 Gly Phe Ser Asp Phe Ser Pro Val Phe Met Gln Arg Pro Ile Gly Asn 115120 125 Val Cys Gly Met Leu Ile His Val Asp Leu His Ser Arg Leu Pro Phe130 135 140 Leu Ile Ala Val Ser Arg Leu Gly Glu Ala Gly Gly Ser Pro CysLeu 145 150 155 160 Tyr Met Arg Lys Ile Asp Val Asp Leu Asp Thr Gln ArgVal His Phe 165 170 175 Tyr Thr Glu Asp Trp Phe Ser Glu Phe Ala Asn LeuLeu Tyr Tyr Trp 180 185 190 Gln Met Ser Glu Trp Lys His Leu Ala Glu ArgMet Gln 195 200 205

1. CELO virus obtained by in vitro manipulation of a plasmid-cloned CELOvirus DNA.
 2. CELO virus according to claim 1, characterised in that itcontains the left and right inverted terminal repeat and the packagingsignal and has modifications in the form of insertions and/or deletionsand/or mutations in other regions of the CELO virus DNA.
 3. CELO virusaccording to claim 2, characterised in that it contains modificationswhich are located on a section of the CELO virus DNA which comprises thenucleotides from about 201-about 5,000 and/or on a section whichcomprises the nucleotides from about 31,800-about 43,734 and/or on asection which comprises the nucleotides from about 28,114-30,495. 4.CELO virus DNA, contained on a plasmid which is replicatable in bacteriaor yeast and yields virus particles after being introduced into cells,optionally together with a plasmid which complements a gene which may bemissing from the CELO virus DNA and is necessary for the formation ofmature virus particles.
 5. CELO virus according to one of claims 2 to 4or CELO virus DNA derived therefrom containing foreign DNA.
 6. CELOvirus or CELO virus DNA according to claim 5, containing as foreign DNAa DNA which codes for a therapeutically active protein.
 7. CELO virus orCELO virus DNA according to claim 6, containing as foreign DNA a DNAwhich codes for an immunostimulatory protein.
 8. CELO virus or CELOvirus DNA according to claim 7, containing a DNA coding for a cytokine.9. CELO virus or CELO virus DNA according to claim 5, containing asforeign DNA a DNA which codes for a tumour antigen or a fragmentthereof.
 10. CELO virus or CELO virus DNA according to claim 5,containing as foreign DNA a DNA which codes for an antigen derived froma human pathogen.
 11. CELO virus or CELO virus DNA according to claim 5,containing as foreign DNA a DNA which codes for an antigen derived froman animal pathogen.
 12. CELO virus or CELO virus DNA according to claim11, containing as foreign DNA a DNA which codes for an antigen derivedfrom a bird pathogen.
 13. CELO virus or CELO virus DNA according toclaim 5, containing as foreign DNA a DNA which codes for a ligand formammalian cells.
 14. CELO virus or CELO virus DNA according to one ofclaims 5 to 13, characterised in that it contains the foreign DNA in theFseI cutting site at position 35,693 or beyond this cutting site orclose to this cutting site.
 15. CELO virus or CELO virus DNA accordingto one of claims 5 to 13, characterised in that it contains the foreignDNA on a section which comprises the nucleotides from about28,114-30,495, which contains the fibre 1 gene.
 16. CELO virus or CELOvirus DNA according to one of claims 5 to 13, characterised in that itcontains the foreign DNA in the region of the reading frame atnucleotide 794 which codes for dUTPase.
 17. Process for preparingrecombinant CELO virus or recombinant CELO virus DNA according to one ofclaims 2 to 16, characterised in that the CELO virus genome or sectionsthereof contained on a plasmid are genetically manipulated.
 18. Processaccording to claim 17, characterised in that the CELO virus genome orsections thereof contained on a plasmid are manipulated in a regionwhich is different from the left and right inverted terminal repeat andfrom the packaging signal.
 19. Process according to claim 17 or 18,characterised in that insertions and/or deletions are carried out. 20.Process according to claim 19, characterised in that a foreign gene isinserted in a naturally occurring or artificially introduced restrictioncutting site on a section of the CELO virus DNA which contains asequence which is not necessary for the replication of the virus in thehost cell or which can be complemented.
 21. Process according to claim20, characterised in that in addition to the insertion, a deletion iscarried out in another region of the CELO virus DNA which contains asequence which is not necessary for the replication of the virus in thehost cell or which can be complemented.
 22. Process according to claim17 or 8, characterised in that the manipulation is carried out byrecombination.
 23. Process according to claim 22, characterised in thatDNA molecules obtained by polymerase chain reaction are recombined. 24.Process according to claim 22, characterised in that DNA moleculesobtained by ligation are recombined.
 25. Process according to claim 22,characterised in that DNA molecules obtained by cloning in bacteria arerecombined.
 26. Process for preparing recombinant CELO virus DNAaccording to claim 25, characterised in that a CELO virus DNA fragmentcontaining two restriction sites is cloned into a bacterial plasmid, theforeign DNA is inserted between these restriction sites which occur onlyonce on the plasmid, the fragment containing the foreign DNA is excisedfrom the plasmid and mixed with a plasmid which contains the completeCELO virus DNA and which has been cut at a restriction cutting sitewhich occurs only once, and in that bacteria are transformed with thismixture of DNA molecules and the bacteria are grown, whilst byrecombination of the DNA molecules a plasmid is obtained which containsthe entire CELO virus DNA with the foreign DNA as an insert.
 27. Processaccording to one of claims 20 to 26, characterised in that a reportergene is inserted as the foreign DNA.
 28. Process according to claim 27,characterised in that the reporter gene has a restriction cutting siteinto which a foreign DNA as defined in one of claims 6 to 13 is insertedin an additional step.
 29. Process for preparing CELO virus,characterised in that avian cells are transformed with a plasmidcontaining CELO virus or DNA derived therefrom, as defined in one ofclaims 1 to 16, the cells are cultivated and the CELO virus particlesare harvested.
 30. Process according to claim 29, characterised in thatthe CELO virus DNA lacks sequences which are necessary for the formationof mature virus particles and the missing DNA sequences arecomplemented.
 31. Process according to claim 30, characterised in thatthe avian cells used are helper cells which complement the missing geneor genes.
 32. Helper cells containing CELO virus genes integrated intheir genome.
 33. Helper cells according to claim 32, characterised inthat the cells are avian cells.
 34. Process according to claim 30,characterised in that the cells are also transformed with a plasmidwhich complements the gene or genes missing from the CELO virus DNA andnecessary for the formation of mature virus particles.
 35. Processaccording to claim 30, characterised in that the cells are also infectedwith a helper virus which complements the gene or genes missing from theCELO virus DNA and necessary for the formation of mature virusparticles.
 36. Pharmaceutical composition for use in gene therapy,containing a CELO virus according to claim
 6. 37. Vaccine againstinfectious diseases in humans, containing a CELO virus according toclaim
 10. 38. Vaccine against infectious diseases in animals, containinga CELO virus according to claim
 11. 39. Vaccine against infectiousdiseases in birds, containing a CELO virus according to claim
 12. 40.CELO virus according to claim 7, 8 or 9 for the production of cancervaccines.